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Preface 


Durinc the academic year 1948-1949 a set of mimeographed notes entitled 
"Designs for Research and Observational Experiments" was prepared for a 
series of lectures at the Geneva Experiment Station, Geneva, New York, and 
for class use in a course on the design of experiments at Cornell University. 
The original notes were revised and expanded to obtain the material for this 
text; this material is used for the present course on the design of experiments 
at Cornell University. 

Facts concerning experimental designs are scattered throughout statistical 
literature, and no single text suffices entirely for a course on the design of 
experiments. The basic reason for writing this book was to present subject 
matter and techniques not available in other texts. À comprehensive coverage 
of the design and the analysis of experiments is given. The analyses of designs 
with missing observations, the analyses for variations from standard designs, 
components of variance analyses, and covariance analyses are treated in 
detail. Extensive use is made of numerical examples. Problems and literature 
citations to numerical examples are given for the various designs. A unified 
approach is followed for confounding in factorial experiments and for the 
analysis of incomplete block designs. 

The design and analysis for a particular experimental design is presented in 
the first part of a chapter. This is followed by variation in the basic design. 
The material on components of variance is presented in the last part of a 
chapter. With the material arranged in this fashion it is possible to teach 
several types of courses from this book. The most elementary course would 
utilize the material in Chapter I, the first parts of Chapters IV, V, and VI, 
Chapter VII, and the first parts of some of the remaining chapters. A more 
comprehensive course would make use of the material in other sections of the 
book. Also, a short course on variance component analyses would use the 
material presented in the last sections of Chapters IV, V, VI, VIII, X, XI, 
and XVI. 

The student will need to understand basic statistical analyses and concepts 
as described in the texts by Professors С. W. Snedecor, Statistical Methods, 
and R. A. Fisher, Statistical Methods for Research Workers. The latter reference 
is somewhat more advanced than is required for an understanding of most 
parts of the present dissertation. Full comprehension of the mathematical 
theory underlying the analyses of the various designs requires mastery of the 
principles of calculus and matrix algebra. However, the material on variance 


vii 


viii Preface 


component analyses is presented in such a fashion that the manipulations 
may be performed with an understanding of college algebra. 

In general, the 5 per cent level of significance is used throughout the text. 
Whether or not this is a desired level depends upon the stage of the research 
program, the possible economic gains or losses resulting from the research, 
and the number of replicates used. In the early stages of a program, the 
experimenter may be investigating a number of treatments to determine if 
they should be tested further. Material for more than one, two, or three 
replicates may be unavailable, and the experimenter may wish to use a 10, 
15, 20, or 25 per cent level of significance test in order to have a type II error 
(error of failing to detect a specified difference when the treatments actually 
differ by more than this amount) of reasonable size. In the final stages of a 
program, the experimenter may wish to use a one per cent level of significance 
test and to use enough replicates or samples to have a type II error which is 
one per cent or less. 

The author wishes to express his thanks and appreciation to J. M. May, 
E. S. Pearson, H. O. Hartley, and the Biometrika Trustees for permission 
to reprint the material in table 11-1; to D. B. Duncan and the editor of Biomel- 
rics, G. M. Cox, for permission to reprint the material in table II-3; to T. F. 
Kurtz, В. F. Link, D. L. Wallace, and J. W. Tukey for permission to reprint 
the material in table II-4; to D. B. Duncan and to the editor of the Virginia 
Journal of Science, Boyd Harshbarger, for permission to reprint the material 
in tables 11-5 and 11-6; to M. Merrington, C. M. Thompson, E. S. Pearson, 
H. O. Hartley, and to the Biometrika Trustees for permission to use the 
material in table II-8; to J. F. Crow and to the editor of the Journal of the 
American Statistical Association, Allan Wallis, for permission to reproduce 
figure II-4; to M. Harris, D. С. Horvitz, A. M. Mood, and to the editor of the 
Journal of the American Statistical Association for permission to reprint 
tables III-1, III-2, and a part of III-3; to E. Lehmer and to the editor of the 
Annals of Mathemalical Statistics, E. L. Lehmann, for permission to reprint 
the material in table III-4; to C. R. Henderson for permission to use the 
material in section VIII-5.3; to W. G. Cochran for the material in section 
XI-8.1; to M. L. Richards for the analysis in section XIV-2.4; to G. M. Cox 
ci the analysis in section XV-3; to D. B. Duncan for some of the problems in 
4 apters X and XI; and to the numerous other experimenters whose work 

аз been used. 
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Cochran, G. W. Snedecor, R. A. Fisher, and O. Kempthorne for my present 
knowledge of the design of experiments. Numerous references have been 
made to the writings of these individuals. 

For their many helpful comments on the first set of mimeographed notes, 
I wish to thank R. G. D. Steel, W. J. Drapala, J. H. Torrie, and students in 
my classes. The constructive comments and searching queries of D. S. 
Robson, H. Fairfield Smith, Prasert Na Nagara, and C. R. Henderson were 
of great value in preparing the final manuscript; these individuals spent a 
considerable amount of time in reading the manuscript and their aid is 
greatly appreciated. Several other individuals, J. E. Dowd, W. H. Leonard, 
A. G. Clark, E. E. Houseman, H. L. Everett, D. D. Mason, H. L. Lucas, 
and other associates, offered helpful comments on parts of the mansucript. 
J. W. Tukey, D. B. Duncan, and F. J. Anscombe were kind enough to read 
chapter II; their comments were greatly appreciated. 

The diligent efforts of Mrs. Helen Resnick, Mrs. Alison O. Foster, and 
Mrs. Joan Pearsall in preparing the manuscript and the tables for publication 
are greatly appreciated. Thanks are due to Mrs. Helen Resnick for aiding 
with the proofing of galleys and page proofs and with indexing. Especial 
thanks are due to Mrs. Lillian E. Federer for her diligent efforts in checking 
the manuscript, in proofing the galleys and page proofs, and in the preparation 
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EXPERIMENTAL DESIGN 


СНАРТЕК 1 


Introduction 


.. All fields of research have at least one feature in common, i.e., the vari- 
ability of experimental material. The universality of heterogeneity has been 
known and discussed at considerable length in the literature (e.g., in the writ- 
ings of J. Arthur Harris [142-146] from 1913 to the early thirties). The vari- 
ability in experimental material may vary in size, but its omnipresence cannot 
be denied. When the variability is small relative to the group or class differ- 
ences and the observations are not costly, an elaborate experimental design is 


‘not required. The experimenter simply takes the results from a large number 


of readings and observes the relationships or mean differences. For example, 
it may be a simple procedure to take ten thousand observations on each level 
of a factor. The variation around a mean for a given level may be made as 
small as desired simply by taking more observations. А. curve may be fitted to 
the several means, and the curve is essentially the theoretical one. 

When there is considerable variation from observation to observation on 
the same material and it is not feasible to take a large number of observations, 
the experimenter is forced to refine his experimental techniques and (or) to use 
an experimental design which allows for unbiased estimates of the true treat- 
ment differences with a specified degree of precision. It is necessary that a 
probability statement be attached to treatment differences if inferences be- 
yond the data are to be made. (The term treatment is a general one and is used 
to denote a class, category, or group.) In order to attach a probability state- 
ment to estimated treatment differences, Fisher [126] has stated that randomi- 
zation and replication are two required conditions for an experimental proce- 
dure or design. This also means that randomization and replication are neces- 
sary to obtain a valid estimate of the error variation. A valid estimate of the 
error variance (or experimental error variance) makes possible statements con- 
cerning the percentage of errors committed by the experimenter when he states 
that treatment means are different when the differences are actually chance 
sampling fluctuations and that treatment means do not differ when they 
actually do differ by more than some specified value. 

Regardless of the size of relative variability, there are certain principles 
of scientific experimentation that should be followed. Perhaps it is not the 
place of the statistician to stress these principles, since most of them are 
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nonstatistical [72]. However, a consulting statistician is continually faced 
with statistical problems incurred because the experimenter failed to observe 
one or more of the basic principles. The progress of research in any field will 
* be greatly accelerated if the questions whose answers are sought, probable 
results, design of the experiment, selection of the entries to be tested (the 
treatments), the method of analysis, presentation of results, etc., are closely 
studied before the experiment has been conducted. 


I-1 The Principles of Scientific Experimentation 


The principles involved in scientific experimentation have been stated in 
many ways and in many places. A few of the books written on the subject are 
cited at the end of this book [41, 63, 89, 167, 245, 312]. The books by Wilson 
[312] and Churchman [41] are recommended for reading since they include a 
discussion of scientific experimentation in light of present statistical knowl- 
edge. Also, articles on experimental methods, on statistical analyses, and on 
the interpretation of results appear in a number of scientific journals [e.g., 2, 
170, 250, 274]. Goulden [135], Love [196], and Leonard and Clark [193] have 
enunciated these principles for agronomists. The principles of scientific experi- 
mentation are listed below and are discussed individually. 

1-11 FORMULATION OF QUESTIONS TO BE ANSWERED AND HYPOTHESES 
TO BE TESTED 

A scientific experiment should be set up to answer a specific question or 
questions. Precise formulation of the question (or questions) to be answered 
enables the experimenter to state his hypotheses in more precise terms and to 
plan his experimental procedure more effectively. Clear and precise questions 
and hypotheses at this stage enable the experimenter to proceed rapidly to the 
next step. It is advisable to have a statement of the questions and hypotheses 
in written form. 

A hypothesis is a statement about the parameter or parameters in one or 
more populations. A null hypothesis is a statement of no difference between 
the parameters involved. The null hypothesis can never be proved by any 
finite amount of experimentation [126, sec. 8]. Given a particular null hy- 
See with no specific alternative hypothesis, the experimenter either rejects 
oF Coes not reject the null hypothesis. The term “does not reject” does not 


ae е of the null hypothesis for the case where no specific alterna- 
Ы on stulated. Experimental evidence can lead to the rejection of the null 
Ypothesis but not to its acceptance. 
_ Fisher [126, sec, 8] states that the above i 
simultaneous consideration of several h 
volving statistical ‘estimation.’ ” Spec 


deas may be extended to the 


— 


$ 1-1.2] Principles of Scientific Experimentation 3 


the hypothesis of no difference, say Ho, may be accepted. Given a rule of pro- 
cedure for accepting or rejecting a null hypothesis, it is possible to calculate the 
errors committed when any one of the hypotheses is true. Rules of procedure 
generally fix in advance the probability of error of rejection when the null 
hypothesis is true. Then, one is able to calculate the errors associated with 
alternatives. 

If a null hypothesis, Ho, and an alternative hypothesis, say Hi, are post- 
ulated by the experimenter, two kinds of error may result. The first kind of 
error (type I error) arises when Ho is rejected for a given experiment when 


Hp is actually true. The second kind of error (type II error) arises when H is 


accepted when it is actually false and Hi is true. 

Null hypotheses and alternative hypotheses should be formulated precisely 
for each experiment. Clear thinking about the problems involved in each ex- 
periment is necessary in setting up hypotheses, and care-should be used in 
selecting the appropriate tool for testing a hypothesis. Various statistical 
methods have been developed for testing hypotheses. A statistical test or a 
test of significance of a hypothesis is a procedure whereby the hypothesis is 
rejected or is not rejected, with sufficient explanation of what “not rejected” 
means, "Not rejected" may mean a qualified acceptance until sufficient evi- 
dence accumulates. Statistical tests for various hypotheses are given through- 
out the following chapters. 

Several tests of significance may be available for testing а given hypoth- 
esis. A good test is one which minimizes the probabilities of all errors over all 
possible values of the parameters. Some tests are better than others for a given 
alternative hypothesis but are not as good for other alternatives. In order to 
compare two tests for two hypotheses, the procedure is as follows. First select 
a type I error (say .05). Then compute the quantity, } — probability of a type 
II error, which is called the power of the test, for each test; the test with a smaller 
type II error, or a higher power, is the better test. 

The above discussion contains the general ideas concerned with testing 
hypotheses. For a fuller treatment of this topic the reader is referred to the 
books by Mood [212, Ch. 12], Wald [299, 300], and Mann [207, Ch. 6]. 


I-L2 A CRITICAL AND LOGICAL ANALYSIS OF THE PROBLEM OR PROB- 


LEMS RAISED 

After formulating the hypotheses, the experimenter should critically and 
logically evaluate them. А review of pertinent literature isa valuable aid in 
evaluating hypotheses. The reasonableness and utility of the aims of the ex- 
periment should be carefully considered. After a critical evaluation of the 
hypotheses in step 1, the experimenter may find it advisable to reformulate 
the questions and hypotheses before proceeding with the experiment. j 

In order to utilize statistical tools such as Statistical Decision Functions 
[300], the experimenter is required to evaluate the possible outcomes of the : 
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experiment in terms of relative costs. This step involves more time spent in 
considering the hypotheses in terms of various outcomes. Consideration of 
possible outcomes and costs is necessary if one is to make use of new and power- 
ful statistical tools. The experimenter needs to provide answers to such ques- 
tions as: (i) “If the results are of such and such a nature, what do they mean 
to me in dollars and cents?,” (ii) “If the results are of another nature, what 
value do I ascribe to them?,” etc. 


I-1.3 SELECTION OF A PROCEDURE FOR RESEARCH 


The procedure to be followed depends to a large extent upon the field in 
which the research is being conducted. The selected experimental procedure in 
most fields of research will involve some or all of the following considerations: 


L— (i) the selection of treatments or entries to be included in the experiment, 
(ii) the selection of characteristics to be measured, 
L (iii) the selection of the unit of observation, number of replications, and the 


sampling or experimental design (The latter should be amenable to statistical 
analyses.), 


2—(їу) the control of the effect of adjacent units on each other or “competition,” 


L—(v) an outline of pertinent summary tables and probable results, 
£— (vi) an outline of analyses to be performed, 


(vii) a statement of costs in terms of material, personnel, equipment, etc. 


After considering the above items, the experimenter may decide to re- 
formulate the hypotheses, to change the experiment, or to continue to the next 
Step in scientific experimentation. The formulation of hypotheses and the 
selection of treatments are extremely important and closely associated consid- 
erations. A large part of the success of an experiment may depend upon the 
correct selection of treatments. Also, the selection of an appropriate experi- 
mental or sampling design is of considerable importance in the testing of 
hypotheses and in estimating treatment effects. 

I-1.4 SELECTION 


OF SUITABLE MEASURING INSTRUMENTS AND CON- 
TROL OF THE PERSONAL EQUATION 


In any experiment the experimenter must select measuring instruments 
that are sufficiently accurate 
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tions are not of equal value in their contribution to a treatment mean. For 
. example, a field plot or a laboratory sample for one of the treatments may be 
destroyed or damaged because of the carelessness of the technician; a group of 
the experimental animals [198] or plants may become diseased; adverse weath- 
er conditions may destroy or damage part of an experiment; etc. The question 
may arise as to whether or not zeros or abnormal values from the damaged 
experimental units should be retained in the statistical analyses or deleted prior 
to statistical analyses. In this regard, experimenters are urged to consider the 
following rule: Zf the observation contributes to the true treatment difference, il 
should always be retained. If the observation does not contribute to the true treat- 
ment difference, it should be discarded, provided that the reason for discarding the 
observation is one that would be held valid by the experts in that field. 

The appropriate measuring instrument should be selected prior to taking 
any measurements and should be retained throughout. Changes of measuring 
instruments should not be made in the middle of an experiment. 

The problem of controlling other personal biases may or may not require 

"elaborate precautions. Often some simple procedure suffices. For example, in 
the analyses of samples in the laboratory a random assignment of numbers to 
samples often provides unbiased results which are not possible when the tech- 
nician knows the identity of the samples. 

I-1.5 A COMPLETE ANALYSIS OF THE DATA AND THE INTERPRETATION 
OF RESULTS IN LIGHT OF EXPERIMENTAL CONDITIONS AND HYPOTHE- 
SES TESTED 

Statistical procedures are valuable aids in reducing the data to summary 
form. The results of an experiment must be interpreted in light of the statistical 
evidence obtained and the theoretical considerations of the subject under 
experimentation. The "interpretation" of the results of an experiment does 
not end with the calculation of a mean, of an F value, or of a confidence inter- 
yall In testing hypotheses and estimating effects, it is essential to use the ap- 
propriate statistical procedures. The resulting computations should be checked 


to insure against computational errors. 


11.6 PREPARATION OF A COMPLETE, CORRECT, AND READABLE RE- 
PORT OF THE EXPERIMENT 

sults of an experiment should be completely and carefully 
form. This may involve typewritten copies to a selected 
a printed report in published form. One should present 
data to allow others to test the various hypotheses. By 
giving an account of the experimental procedure and statistical methods (or 
references to statistical methods) used, the reader may decide for himself 
whether or not he considers the procedure sound. 


The question often arises about so-called "negative results,” i.e., results 
which would lead 


The pertinent ге 
reported in written 
group of individuals or 
enough of the summary 


one not to reject the hypothesis of no difference (the null 


6 Introduction [§ I-1.7 


hypothesis). If the experimental procedure is correct, there are no “negative 
results,” and a report should be made. The experimenter should offer a reason 
for not finding differences which he suspected were present. Explanations of 
"negative results" are often of considerable value. A careful report on such 
experiments may be of considerable value to other experimenters conducting 
similar research. 

It is conceivable that a principle could appear to be established if only 
"positive results" were published and if the reported results reflected merely 
discrepant sampling fluctuations and not true treatment differences. For ex- 
ample, medical researchers might be evaluating the effectiveness of two drugs, 
А and B. Suppose that A and B are equally effective but that the only experi- 
mental results published are those which indicate that A is better than B. As 
a result, drug A may be selected over drug B. In this case, no harm results 
to the patient, since the drugs are equally effective. If detrimental side effects 
of drug А over drug B are present, or if the cost of manufacturing drug A is 
higher, the failure to report all results would have undesirable consequences. 


1-1. STATISTICS IN RELATION TO THE PRINCIPLES OF SCIENTIFIC * 
EXPERIMENTATION 


It should be emphasized that the above principles of scientific experimenta- 
tion are not entirely statistical, nor does every step involve statistics. For, 
example, steps one and two are entirely nonstatistical and are completely 
within the scope of the field in which the experiment is performed. In step 
three (i), (ii), (iv), and (vii) are nonstatistical, (у) may or may not be statisti- 
cal, and items (iii) and (vi) are entirely statistical in nature. Step four is partly 
statistical. Step five is highly statistical, but step six is nonstatistical. Hence, 
the design of the experiment is only one item to be considered in planning and 


conducting the experiment and in interpreting and reporting the experimental 
results, 


I-2 Classification of Experimental Designs 
In planning any experimen 
periment is selected. The sele 


domized designs. Since 
domized designs can be 
ү п made, y ‹ i 
' onl a few selected examples of syste 


1-2.1 SYSTEMA TIC DESIGNS 
Pri r to the dey lo Г 1 t 5: 
10) €lopment of , 
А mode n experimental de 1gns. exper imenters 


ents which are not subject to the laws of chance. Syste- 
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matic schemes of arranging the treatments in the various repetitions have been 
devised. One such scheme is to arrange all duplicates, triplicates, etc. of the 
treatment together. Suppose the experimenter wishes to test three treatments, 
A, B, and C, and that he decides to have four repetitions of each treatment. 
The arrangement of the three treatments over the experimental area could be 


one of the following: 


AWANSAWAWESBEENXBEZBN[NCECHCEG 


A A А А 
сссс 


ppp 
шшщщ 
aoaao 


А А BUB c C 
АА: B B сс 


From fertilizer, yield, and other trials or experiments, it is evident that it 
might be better to test treatments A, B, and C together in a compact block 
and then to repeat these blocks. Before Prof. Fisher established the principle 
of randomization, a systematic ordering of the treatments in each block or 
repetition seemed natural. One of the more common types of systematic ar- 
rangements in which the treatments are repeated several times is the following: 


Replicate I Replicate II Replicate III 
A BC A BC | AUBIG 
or 
Replicate I A BC 
Replicate II A BC 
Replicate III A B C 


In this case the ordering of the treatments is exactly the same in every repli- 
cate (a unit which contains all the treatments). Another systematic arrange- 


ment is the following: 
Replicate I Replicate IT Replicate III 


A BC CXESBSCÓCOBOGNA 


8 Introduction [8 1-2.1 


In this case, each treatment occupies each order in the replicate. „ 
Another systematic arrangement proposed is the "diagonal square" [126, 
sec. 34]; for three treatments the design is 


In order to eliminate the effect of A appearing on one diagonal, a systematic 
arrangement involving the Knight's Move was proposed, i.e., one down and 


two over. This arrangement for five treatments in three replicates gives the 
following design: 


Replicate I 
Replicate II 
Replicate III 


e above design has been known in Denmark 
and called the Knut Vik Square B Abel to d perpe, CRUS 


c ystematic design i VEL 
tem with square-yard = gn is Dr. Beavan’s 


ots. Student [284] states that thi 
. . 2 
at Warminster prior to 1909. Beavan’s arrangement follow: 


chessboard” sys- 
design was in use 
s: 
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The above arrangement is repeated as often as necessary. 


Student [284] describes another arrangement, Beavan's half-drill strip 
method, that has been used extensively in field trials with cereals. Arrange- 
ment for the half-drill method follows: 


Numerous other systematic arrangements have been devised, with the 
various experimenters attempting to outguess natural variation. Regardless 
of the type of systematic design, they all have relatively the same advantages 
and disadvantages [326a]. The advantages are often given as 


(i) Simplicity. Many experimenters feel that planting, note-taking, and harvest- 
ing in agronomic trials are facilitated by using systematic arrangements. In 
judging or scoring experiments, it is sometimes felt that the judge will be 
better able to “discriminate” between the treatments if he (the judge) knows 
the order in which the treatments occur in the different repetitions. 

(ii) The systematic design provides‘ adequate" sampling of the experimental area; 
that is, it allows for “intelligent placement” of the various treatments. — 

Gii) Varieties may be arranged in order of maturity; fertilizer treatments may be 
arranged in order of increasing fertility; etc. 

(iv) It may be desirable to alternate dissimilar varieties (say, bearded versus 
beardless barley) so that natural crossing or mechanical mixtures can be de- 
tected in subsequent years. 

(v) There is no need to randomize, since the heterogeneity of the experimental 
site is such as to randomize the effects on the treatments. (This does not 
lessen the effect of one treatment on another or of a single arrangement; 


these facts should not be ignored.) 


The disadvantages of the systematic designs are: 


(i) there is no valid estimate of the variance; 
(ii) the correlation between adjacent plots may lead to systematic errors in asses- 


sing treatment differences. 
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The latter point is easily illustrated by the following systematic arrangement: 


Replicate I Replicate II Replicate III 
A BC | ABC | ABC 


where the yield gradient is assumed to exist from left to right. Even though 
treatments A, B, and C may be the same thing, the experiment would show 
А to be better than B, and B better than C. In the event that the treatments 
are different, their differences may be exaggerated or underestimated, depend- 
ing upon the arrangement of the treatments. 

Field experience has demonstrated that with proper organization the ex- 
perimental procedure in randomized arrangements is generally as simple as 
with systematic arrangements. It is often advantageous if the judge does not 
know the identity of the treatments, since the comparison of material with 
known identity often leads to biases. 

Fisher [126, sec. 27 and 34] discusses some systematic arrangements and 
their effect on tests of significance and on the estimation of an error variance 
[see 70]. Suppose that systematic arrangements are tried on uniformity trial 
data (plot data on the same treatment over the whole of the experimental site 
or area).! Then the treatments in the experimental area would all be the same 
thing. The total sum of squares (ss) would be a constant regardless of what 
arrangement was chosen. If the experimenter is able to place the treatments so 
that all are subjected to about the same heterogeneity, then the sum of squares 
due to the differences between dummy or pseudo varieties would be decreased. 
The decrease must be counterbalanced by an increase in the error or remainder 
sum of squares, since the total is a constant: 


total ss = ss among dummy var. + ss within var. 


If, on the other hand, the experimenter does a poor job of placing the dummy 
varieties, the estimate of the error sum of squares will be smaller than it really 
should be and the differences between the dummy varieties will be exaggerated. 
Some arrangements may consistently underestimate the error variance. The 
amount of underestimation is unknown, and any attempt to obtain an estimate 


of the error variance from systematic arrangements is pretty much a matter of 
guesswork [see 70]. 


1-22 RANDOMIZED DESIGNS 


Various classifications for randomized designs may be made. For example, 
such classifications as completely randomized (randomized over the whole of 
the experimental area), randomized complete block (randomization confined 
to allotment of all treatments within each of the several blocks), latin squares 
and variations, incomplete blocks (only a fraction of the set of treatments in 

The R METRIS T TR З " 

L d discum ceps шш ы ыша һу тегі nd Oe 


>. 


§ 1-2.2] Classification of Experimental Designs п 


each of the several blocks), and others could be used. One could extend this 
classification in the following manner: 


1A. Systematic—treatments laid out systematically by design 
1B. Randomized—treatments randomly allotted to the experimental units 
2A. Experimental area not subdivided prior to randomization 
—completely randomized 
2B. Experimental area subdivided prior to randomization 
3A. Complete blocks—all treatments appear together in a block 
4A. No additional restriction— 
4B. Randomized complete block 
—additional restrictions 
5A. Two restrictions 
-latin square 
—cross-over 
5B: More than two restrictions 
—graeco-latin square 
—hyper-graeco-latin square 
—others 
3B. Incomplete blocks—only & of the v treatments appear in any one block 
4A. Incomplete blocks not grouped into complete blocks by replicates 
(v treatments in blocks of k) 
—partially balanced 
—balanced 
—others 
4B. Other designs 
5A. Incomplete blocks grouped into complete blocks 
6A. n-dimensional (k” treatments) 
7A. Partially balanced 
one restriction 
—simple lattice 
—triple lattice 
—cubic lattice 
—quartic lattice 
—split plot 
—split block 
two restrictions 
—incomplete lattice square 
—semi-balanced lattice square 
—split split plots 
etc. 


n-restrictions 
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7B. Balanced (k — prime or power of a prime number) 
one restriction 
—balanced lattice 
two restrictions 
— balanced lattice square 


n-restrictions 
6B. pgrs...treatments 
7^. Partially balanced 
—rectangular lattice 
—split plot 
—split block 
7B. Balanced 
5B. Other designs 
6A. Incomplete latin square 
—Youden square 
—quasi-latin square 
—semi-latin square 
6B. Others 
—latin squares with split plots 
—rotalion experiments 


s. Thus, the choice of a design depends 
tested and the variability present. 
designs relative to Systematic designs 


is that they alyses such as the analysis of variance 


[127, 273]. 


I 


are subject to statistical ап 


-3 Selection of an Experimental Design 


The selection of an experimental desi 
tates knowledge of the variability of the 
the number of ; replicat i 


a selection 1 of more hi mogeneous material 

Ш) stratification of experimental Та] in 

MU) stratification of ех al material into homogene 

: OUS S , 
(iii) Increasing the number of Observations or deum Чоц 


ions, and 
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(iv) measurement of one or more related characteristics in order to use regression 
techniques. = 


The selection of homogeneous material may be relatively easy in that the 
experimental animals, insects, plants, etc. of a particular strain may be rela- 
tively homogeneous genetically. The variation among members of the strain 
or samples of the material may be small relative to the size of treatment 
differences to be detected. For such material the design listed under 2A of 
section 1-2.2 is suggested. 

If it is impractical to obtain a single homogeneous group of material, several 
small groups of homogeneous material are often available, e.g., several litters 
of animals in which the litter mates are similar but the litters differ, half leaves 
or opposite leaves on a plant, pieces of a cake or pie, lengths of a piece of lumber 
or steel, etc. In such a situation, one of the designs listed under 2B of the pre- 
vious section is suggested. The complexity of the design is determined to a 
large extent by the number and nature of treatments and by the sources of 
variation to be controlled. 

The selection of an experimental design is dependent upon the nature of the 
sources of variation in the experimental area or material. However, if an experi- 
menter is given one guiding principle in selecting a design, it might be to choose 
the simplest experimental design possible from a layoul and analysis standpoint 
which allows for adequate control of the variability. If a design presents pro- 
cedural difficulties and is costly to analyze relative to the cost of collecting the 
dala, a simpler experimental design should be chosen. 

One experimental design is said to be more efficient than a second if the 
error variance of a treatment mean is smaller for the first design than for the 
second design. In computing the efficiency of one design relative to another, 
the cost as well as the number of degrees of freedom may be considered. A fuller 
discussion of efficiencies of the various experimental designs is presented in 
later chapters of the book. Two designs with different numbers of replicates 
could have the same relative efficiency per dollar spent if the cost of the addi- 
tional replicates in one design equals the additional cost of analysis and addi- 
tional experimental procedural costs for the second design, i.e., on a per unit 
of information basis. The efficiency of one design relative to a second, for fixed 


cost, is 
(аа) / (S33) (I2) 


where s? — error variance per unit of observation, r — number of replicates, 
с = cost per replicate, df = error degrees of freedom, and the subscripts refer 
to the first or second experimental design. The correction for the difference in 
the number of degrees of freedom associated with the two error variances 


is necessary whenever the variances are estimated [126, sec. 74]. 
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For situations which require a short period of time to obtain the experi- 
mental results relative to the time required for the statistical analysis, (i.e., 
some laboratory experiments), it may prove less costly to use more replicates 
of a less efficient design with a simple analysis than to use a more efficient 
design with a more complex analysis. If the unit of observation is very difficult 
and costly to obtain (e.g., field experiments, animal feeding experiments, rota- 
tion experiments, some genetic experiments, etc.) and the cost of analysis is 
relatively minor, the experimenter would be well advised to control most 
sources of variation by all the means at his disposal, whether it be a more 
complex design or (and) the measurement of related variates. 

Reduction in the experimental error variance is often accomplished by 
measuring a related variate or variates and using a covariance analysis. If the 
cost of measuring the related variate is relatively cheap and if the correlation 
with the related variate is fairly high, the covariance technique may prove 
extremely useful. (For example, the initial weight of animals in a feeding ex- 
periment is easy to obtain and is related to rate of gain.) A good criterion for 
the research worker might be: if the variability of experimental material is not 
easily controllable, then measure it and use regression techniques to remove the 
initial variability. Often both control of variability and measurement of related 
variates are effective in reducing the experimental error variance. 


1-4 Validity and Choice of an Experimental Error 


The principles guiding the validity and choice of an error variance have 
been fully explained by Fisher [126, secs. 9, 10, 11, 65], but will be briefly 
recapitulated here for completeness. In order to have a valid error term for 
testing the differences among treatments, an experimental design of the 
randomized nature and replication of treatments is a necessity. Such a proce- 
dure allows for the calculation of unbiased estimates of treatment differences 
and an appropriate error variance. As stated by Fisher [126, sec. 65] the correct 
error variance for testing the variation among a set of treatment means is one 
which contains all the sources of variation inherent in the variation among treal- 
menl means except thal portion of the variance due specifically to the treatments 
themselves. 

These facts are evident from a consideration of the components of variance 
in the various mean squares in the analysis of variance. To illustrate the effect 


of randomization on the validity of an error variance, the following example 
has been prepared. 


жї Example 1-1. Suppose that fifty pigs are available for the experiment and that the 
initial weights {randomly selected samples from 273, table 3.1] for ten random-samples 
of five pigs each are: (30, 29, 39, 17, 12), (19, 42, 27, 25, 22), (16, 41, 37, 31, 25), (17, 30, 
24, 28, 35), (47, 33, 17, 33, 29), (17, 23, 31, 39, 30), (41, 26, 19, 32, 27), (20, 28, 39, 
43, 30), (38, 20, 30, 46, 36), and (42, 47, 41, 31, 29). The means for the ten samples 
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are 25.4, 27.0, 30.0, 26.8, 31.8, 28.0, 29.0, 32.0, 34.0, and 38.0, respectively. The 
analysis of variance is 


Source of variation df ss ms 
Among sample means 9 661.2 13.5 
Individuals within samples 40 3314.8 82.87 


Suppose that ten treatments are applied and that three of the treatments have a 
treatment effect equal to —6, three of the treatments have an effect equal to +6, and 
the remaining 4 treatments have zero effects. The ‘among sample means" mean square 
becomes 4[661.2 + 5(6)(36)] = 193.5. The resulting F = 193.5/82.87 = 2.33, which 
is approximately equal to the corresponding tabulated F value at the 3 per cent level. 
Such a result might lead one to reject the hypothesis of equality of sample means. 

On the other hand, let us disturb the random feature of the experiment and use a 
“balanced” grouping as used in some animal and educational experimentation. The 
individuals are rearranged in such a way that the means of all ten lots of five pigs are 
equal to the over-all mean, 30.2. Some such grouping as the following might result: (30, 
29, 39, 17, 36), (19, 42, 27, 25, 38), (16, 41, 37, 31, 26), (17, 30, 41, 28, 35), (47, 25,-17, 
33, 29), (17, 23, 31, 39, 41), (43, 30, 19, 32, 27), (20, 28, 39, 33, 31), (24, 20, 30, 47, 
30), and (42, 12, 22, 46, 29). 


For the above ten samples the analysis of variance is 


Source of variation df БЫ тоз 
Among sample means 9 0 0 
Individuals within samples 40 3976 99.4 


Now, adding the treatment effects previously obtained results in an F value — 


see = Sr — 1.21, which is considerably lower than F — 2.33 as found for 

random samples. If the treatment effects for the three treatments are increased to 
2 

— 8.34 and to +8.34 for the other three treatments, then F = d = 2.33. 


This requires an increase of 2.34, or 39 per cent in the size of the effects in order to 
detect significance at the 3 per cent level. In addition to the fact that larger effects 
are required for significance, the procedure is incorrect; fewer type I errors (rejection 
of null hypothesis when in fact it is true) and more type II errors (acceptance of null 
hypothesis when it is in fact false) are committed than should have been made when 
using the proper experimental procedure. Also, the within-sample mean square is 


larger than it should be. 
the disastrous effect of the above erroneous experimental 


Another way to observe ; ] 
procedure is to observe the additional US. per group that is required to 
nhs 


obtain an F value equal to 2.33, i.e., 2.33 = 994 — 
gs would have been required to detect the same differences among 


- Solving, п = 9.65 pigs. Roughly 


twice as many pi 
the sample means. 

The above computations have been carried out assuming a within-treatment 
correlation of unity between initial weight and final weight of the animal. If the corre- 
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lation between initial and final weights were zero, the above “balancing” or "equal- 
izing” of sample or lot means would be meaningless and would not affect the validity 
of the resulting statistical analyses. The true situation, however, is somewhere 
between the two extremes, and the experimental procedure of balancing is nol recom- 
mended when significance tests are later desired. The above balancing violates one 
of the basic conditions, i.e., randomization, necessary for obtaining a valid error term 
and hence invalidates the tests of significance. Yates [329] presents a discussion of 
the above procedure. He shows that the above procedure can be made valid through 
the use of covariance techniques for certain types of material. 


I-5 Symbolism 


As with all statistical manuscripts, the present one is subject to the criti- 
cism that the author did not use “standard” statistical symbolism. Since the 
last three words of the preceding sentence may imply something different to 
each writer in the statistical field, it is perhaps best to list and define the sym- 
bols used in the majority of places in the text. In general, Greek letters are 
used to denote the parameters in the population, and Latin letters or Greek 


letters with a hat (*) over them are used to denote the fact that these are 
sample estimates of the parameters. 


Хог Yi (i = 1,2, ---,n=a specified number) is the symbol for the ith 
observation. X; = first measurement, X; = second measurement, etc. Xi 


or Y; (i = 1,2, ---,n andj = 1,2, +++, т) is the jth measurement in the 
ith classification. 


Nim E the sample mean of n observations (Xy, Х»,...,Х„) = the 


sum of the observations, X, + Xat +--+ Xn divided ‘by the number л. 
y is obtained similarly. m is sometimes used to denote the sam 
т. denoting the sample mean of the X; and m, 
£is an unbiased estimate of the population mea 
age value of all X; in the population is д. 


ple mean, with 
the sample mean of the Y. 
n y, a parameter, if the aver- 


] A XG: 
пт Хч а the sample mean of nm observations. 
1 т 
me = ғ. = the sample mean over ally 
classification. Likewise, iso, = Ža = M 
n 


= n a 
(Xi — 2) = т, = deviation from the sample mean. 


У(Х; ET 2)? = Ya 
im i=l 


n 


32, --- m for the ith 


= sum of squares of deviations from the mean. 


= = 52 


= the sample variance (sometimes mean Square) of a single obser- 


wu. 
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vation and is an unbiased estimate of the population parameter о?. 


^ 
Date 


mi Е . „ B bel . 
ы = 5:2 = the estimated variance of a mean of n individuals and is ап 


n(n—1) 


unbiased estimate of the population parameter сг?. 
sd = sz = the standard error of a mean or the estimated standard deviation 
of means of n observations. 


sh S л 

4E + т = 6-6 = sj = the standard error of a difference between two 
1 2 

independently estimated means based on п, and n; observations, respec- 


tively. In the event that т = nz, {2 = 2s = 54. 

bi; = regression coefficient of the variate X; on the variate Хз; and is ап 
unbiased estimate of the population parameter fı. Other symbols will be 
used to designate regression coeflicients, but they will be explained when 
used. 

bin. = partial regression coefficient of Xi: on Xs; independent of Хз. 

ri = total or zero-order correlation coefficient of X;; with Xs; and is an esti- 
mate (though biased) of the population parameter, p. 

cv = coefficient of variation or variation coefficient. 

df = degrees of freedom. 

ss = sum of squares. 

ms = mean square. 

Ea, Es, Ec, Ea etc. = error mean squares. 

Eff = efficiency expressed as percentage. 


msi : же 
M ске Snedecor's F is used as a test of significance.* 
2 
i = "Student's" 1 is used as a test of significance." 


2 = chi-square is used as a test of significance.! 
= } log, F = Fisher's z is used as a test of significance.! 


x 

lanas = tabulated value of t at the с per cent level for n degrees of freedom 
associated with the standard error. 

F,(ny, nadf) = tabulated value of F at the a per cent point with n; degrees of 
freedom associated with the numerator and пз degrees of freedom associated 
with the denominator. 

xa2(ndf) = tabulated value of x” at the о per cent level for n degrees of freedom. 

< is less than. 


> is greater than. 


> is greater than or equal to. ; 
E[X] = average value over all possible samples of a given size or the expected 


value of the quantity X in the population. 
YThe statistics F, £, xê, and z are random variables with known distributions. Tables for 
these distribution functions have been prepared and are used in making tests of significance 


(see Chapter II). 


CHAPTER II 


Some Useful Statistical Tools and Concepts 


Statistical procedures and concepts are useful in the analysis of data and 
in the interpretation of the results from an experiment. А number of useful 
statistical tools—tests of significance for comparisons among a set of ranked 
means, transformations for experimental data, tests of significance for homo- 
geneity of variances, tests for additivity of data, nonparametric tests in the 
analysis of variance, tables of probability levels for the range and F, and a 
figure of probability levels of t and x^—are presented in the following sections. 


Additional methods are described and illustrated in statistics textbooks(75, 
87, 102, 127, 137, 179, 212, 213]. 


II-1 Tests of Significance for a Group of 
Ranked Means 


In experiments involving more than two treatments, the resulting F = 
y be larger than the corres- 
hosen level of significance, say 
then rejects the null hypothesis 
ding which means are different, 


merits of several of these tests 
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sociological reason for the comparisons.! In some problems, there is no mean- 
ingful reason for partitioning the n — 1 degrees of freedom associated with 
comparisons among n means. In such situations, we may, and quite often do, 
wish to know which means may be considered as different from the others. 
Formulated in terms of possible outcomes, this represents the total number 
of decisions one might make about a set of n means. For three treatments the 


possible decisions about the population means are 


(1) m = pa = Hs. 
(2) m < (us is not appreciably different from ys). 
(3) um «(un^ ^ Js p “ us). 
(4) m < (ш ^ © & ш). 
(5) (ш is not appreciably different from p2) < us. 
(6 (m “ “ g s “ ou) € ue 
Mite ^ "08 <a 
(8) ш < p2 < Bs 
(9) ш < ps < и. 

(10) po < ш < з. 

(11) uy < ps < pr 

(12) ws < i1 < uz 


(13) us < p2 <. 
(14) m < ра but из cannot be ranked relative to p Or из. 


“ “ 


(15) < u butina “ “ © «© py OF pe. 
(16) u-«ubutg ^" ^" ы * — * py OF ps. 
(17) u «but, * “ ^ * —— * p OF ps. 
(18) us < u buta] “ “ d * © pe OF ps. 

“ “ “ “ &« Be or из. 


(19) u < us but p 


Also, decisions of the form ш + us < 2us are possible. The number of other 
comparisons possible is determined by the nature of the experimental material. 
Fisher's z test or Snedecor's F test may or may not reject decision (1). If 
(1) is rejected, it may be because one ог more of the other possible decisions is 
the correct one, and the problem is to find which one or ones to accept. 
Although these tests involve a very large number of decisions, it is con- 
venient, from the point of view of understanding their properties, to regard 
them as simultaneous applications of a relatively small number of two- 
decision tests [96, 97]. For example, a multiple range test involving four means, 


comparisons between members of different 


1This does not necessarily imply that 
"i not the statistician, should determine whether 


subgroups are ruled out. The experimenter, 
or not such comparisons are meaningful. 


Й 
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say А, В, C, D, may be regarded as the joint application of two-decision tests 
of each of the following hypotheses: 


(4) all four means are equal, 

(ii) the three means, A, B, and C, are equal, 
(ш) “ а аА ВАСЕ Td а 
(iv) « “ “ A, C, “rp =í rA 
(у) “u а « Е В, C, а р, [1 “ . 
(vi) the two means, А and B, аге equal, 


Gd) the two means, C and D, are equal. 


Апу one of the multiple decision tests can be said to have a type I error 
(as defined in Chapter I) with respect to each of the homogeneity hypotheses. 
Tn testing the four means, there is a type I error corresponding to each of the 
eleven different homogeneity tests of which the over-all test is comprised. To 
distinguish between these type І errors, it is convenient to refer to 


(i) the error of wrongly rejecting the h 
as a four-treatment type I error, 

(ii) the error of wrongly rejecting the hypothesis that any three means are homo- 
geneous as a three-treatment type I error, and 


(iii) the error of wrongly rejecting the hypothesis that, any two means are 
geneous as a two-trealment type I error. 


ypothesis that all means are homogeneous 


homo- 


In general, then, the n-treatment {уре I error refers to the error of wrongly 
rejecting the hypothesis that n treatments are homogencous [96, 97] 
In the following sections the ager 


ment, --+, and v-treatment type I er 
The quotes are used to distinguish 
two decisions, 

The discussion on the kinds of errors со 
the present text. For a full 
is referred to the works of 


egate of the two-treatment, three-treat- 
rors is referred to as the “type I error." 
this error from the one involving only 


mmitted is necessarily limited in 
er understanding of the various errors the reader 
Duncan [96, 97] and Tukey [296]. 

I-1.3 MULTIPLE RANGE TESTS 


II-1.1.1 The lsd test or the 
cedure fo 


» f, associated wit 
bolically the lsd is Ss Ine sg with the standard error of the mean. Sym- 


dlos gay. All differences of means are compared. 


OL Le 
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with the lsd, and if the difference exceeds the lsd, the means are said to come 
from populations with different means. The 1 per cent value of ¢ is sometimes 
used instead of és. The value of the quantity 52/2 бо уау is called the msd, or 
the most significant. difference. Occasionally the lsd is called the minimum 
significant difference. A further variation of the multiple ¢ test at the o per 
cent level is the use of an F or z test first, and if this shows significance, then 
a multiple ( test is performed on the differences among the treatment means. 
This test is discussed by Fisher [126, sec. 24] and in section I1-1.2.1. 

In connection with these tests, Fisher [126, sec. 24] bas stated that the 
v — 1 degrees of freedom among a set of v treatments may be partitioned into 
v — 1 orthogonal comparisons, each with a single degree of freedom, and that 
it is possible to obtain an estimate of the error variance for each single degree 
of freedom comparison in a number of the experimental designs. The difficulty 
here is that the individual error variances usually are associated with a small 
number of degrees of freedom; because of this, the pooled error variance with 
a larger number of degrees of freedom is used whenever possible. Also, more 
than the v — 1 orthogonal comparisons are usually desired. 

The lsd test is appropriate if the comparison is selected prior to conducting 
the experiment. However, if the comparison is selected after the treatment 
means are observed, a certain number of the differences will be large, owing to 
sampling variation. The most extreme case is the comparison of the largest 
mean with the smallest mean in a set of v treatment means. In this case, 
rtley [243, 244] have shown that the v treatment type I error 
2 means but is some larger value. The size of a v-treat- 
1 to 1 — ;P,(Q), where ;P.(Q) is obtained from the 


Pearson and Ha 
is not 5 per cent for v > 
ment type I error is equa 
following formula: 


,PAQ) = Pa(Q) + 40) + КҮЛ шл) 


where P,(Q), an(Q), and 6,(Q) are obtained from table 1 in Pearson and 
Hartley’s paper [244], n = = number of treatments in the group, f = de- 
grees of freedom associated with the error variance, and Q = 4/2 lagar 

For more thah two treatments in a group the size of the v-treatment type 
I error associated with the comparison of the largest mean with the smallest 
mean is larger than 5 per cent. For f = 40 and for losr/2 = 2.86 the ?-treat- 
meni type І error associated with the comparison of the largest with the 
smallest mean is approximately equal to 27 per cent for five treatments, 59 
per cent for ten treatments, and 86 per cent for twenty treatments. Thus, the 
Isd procedure for comparing the ae and жы means should not be used 

i ro t ents in the experiment. 

rM N test. In 1927, Student: [283] sug- 
gested that use be made of the range as a procedure for rejecting divergent 


M C9 N 
samples in a series of routine analyses. Newman [231 msde = of шү 1 


| ШКЕ Р ett DÀ 
m € TT ы V 
тк ELE UNO Т i 1 M 


Sx 


ts of the studentized range qa = Xmaximum ^ Xminimum 


їп 


TABLE 11-1. Upper percentage po 


а = .05 


+01 


а = 


1952. Ап improved version of this table may be found in table 


‘studentized’ range," Biometrika 39: 192-193, 
, Cambridge University Press, 1954, by E. S. Pearson and H. O. Hartley. 


pper percentage points of the 


*This table was reproduced with the permission of the Biometrika Trustees from the paper by May, J. M., "Extended and corrected tables of 
29 of Biometrika tables for statisticians, Vol. 1, 


the uj 
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1231, 283] ideas for subdividing а group of ranked treatment means into 
subgroups asserted to be not heterogeneous and presented tables for a number 
of percentage points of q for 

= Imax — Emin. range 


Шет m = standard deviation' 


where a — the percentage level of q for n treatments, Za. = the largest 


mean in the group, 2ш. = the smallest mean in the group, and ss = —— is an 


m 
independent estimate (such as is obtained from a randomized design) of the 
standard error for each mean. The means must be obtained from the same 
number of observations, Say r, and all standard errors must be equal. 

Newman's table was computed by quadrature from E. S. Pearson's approx- 
imate probability law of the Studentized range. Later, Pearson and Hartley 
[244] computed a table similar to the one obtained by Newman. The tables 
were computed by an approximate method. May [208] recomputed their table 

[244] in order to sbtain results correct to more significant figures. This table is 
reproduced in table 1-1. The last lines (= df) in the two parts of table 1-1 
correspond to the values given in table 5.5 of Snedecor's book [273] and to 
the values used by Student [283]. The first column (n — 2) may be obtained 
by multiplying the ta values for the corresponding degrees of freedom by у. 
The table may be extended for other values of a and degrees of freedom in the 
manner described by May [208]. It is desirable to have not less than ten de- 
grees of freedom, and preferably more, associated with the independent esti- 
mate of the standard error. Therefore, the part of May's table relating to 

degrees of freedom less than 10 was omitted. 

In applying the Student-Newman-Keuls multiple-range test (so called be- 


cause each contributed to its formation [180, 231, 283]), the following steps 
are taken: 


Step (I). Subdivide the 
sociological groups. Natural 
particular set of treatments 
means from two or more nat 

Step (II). Choose a sign 
per cent level. 

Step (III). Compute the standard error of a treatmen 
values W, — da.n52, Wat = qu 182, + 
tasas V2 sz = lsd. Rank the treatmen 


Eas Ent, ++, д, 

Step (IV). Compare the ran 
lated W,. If z, — 7, is less 
asserted to belong to a non-heterogeneous group. If z, — 7, > W, subdivide the 


to # and 7,1 to Z3, and state 

т 5 , n-1 1 

that Tn 18 different from . ranges 2, — Xo ёна Tnr1 — Xi 
п n. 


treatment means into biological, physical, or 
groupings as prescribed by the choice of the 
have meaning; it is doubtful if a ranked set of 
ural groupings has any practical significance, 
ificance level, о, which usually will be the 5 or 1 


t mean, sz, and the 
› Wa = quas; and We = geass = 
t means from highest to lowest, 
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with W,,.1. If either range is less than У/„_у the means in the group are said 
to belong to a single group. If either range exceeds W,..; the n — 1 means аге 
divided into two groups of n — 2 means each and compared with W,. s. The 
process continues until a subset of means is obtained which does not exceed 
the calculated value W;. The process stops whenever the actual range of a 
subset is less than the calculated range. No subset of means is compared if the 
subset is included in a larger subset which is less than the calculated range W;. 

The above procedure has been stated in more concise terms by Duncan 
[97] as follows: “The difference between any two means in a set of n means is 
significant provided the range of each and every subset which contains the 
given means is significant according to an a-level range test." 

Care should be employed in making comparisons within a subset of means 
after some observations have been rejected [231]. It is possible to obtain a 
significant F value on a subset of means whose range is less than the computed 
W;. The Student-Newman-Keuls multiple range test compares the range of 
means in the subset with the calculated range at the a per cent level. The F 
test compares the variance for the subset means with the error variance. Also 
the error rate per experiment is different for the two procedures [96, 97, 296]. 
One or more of the single degree of freedom contrasts from the subset of means 
could give a relatively large F value, whereas the observed range might be less 
than the calculated range. Ап example of this situation is encountered when 
the means in the subset fall into two groups. However, in this case there prob- 
ably are two natural groupings, and this should be taken into account before 
beginning the test. If the grouping in the subset of means is not known, then 
one of the multiple comparisons tests described in ine next section may be 
used to determine which contrast among the means accounts for the signifi- 


cance of the variance ratio test. 


Example 11-1. Snedecor [273, table 10.3] presents the data from an experiment on 
grams of fat absorbed by twenty-four doughnuts for eight different fats. The eight 
means are presented in table 11-2. The independently estimated standard error of a 


mean is equal to sz = 4/141.6/6 = 4.86, with 40 degrees of freedom. 
Step (I). No information is given concerning the relationship among the various 
f the fats may be vegetable fats, others may be animal fats, and the 


fats. Some o А а 
ed. However, with no further information the 


method of processing may have vari 
fats are considered to be a single natural group. 

Step (II). The a = 5 per cent level of significance is chosen. 

Step (III). From table II-1, the values of gos,n8z = calculated range for a homo- 


geneous population = W, for n = 8, 7, 6, 5, 4, 3, and 2 and for f — 40 degrees of 


freedom are 


Waa = 4.52 (4.86) = 22.0, 
Waar = 4.39 (4.86) = 21.3, 
Was = 4.23 (4.86) = 20.6, 
Was = 4.04 (4.86) = 19.6, 
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Wars = 3.79 (4.86) = 18.4, 
Wares = 3.44 (4.86) = 16.7, and 
Wr-2 = 2.86 (4.86) = 13.9 = lsd. 


TABLE II-2. Differences of fat means (grams of fat absorbed 
in 24 doughnuts) 


The ranked means and all possible differences among the eight means are presented in 
table 11-2. Only a part of the table may actually be computed in practice. 

Step (IV). The range of the largest and smallest means exceeds the calculated 
range; thus: 


Z,— 2; = 185 — 161 = 24 > Wg = 22.0. 


Therefore, we compare the ranges for the two subsets of seven means with the cal- 
culated range; thus: 

24 — їв = 185 — 162 = 23 > W,_; = 21.3. 

їз — Фр = 182 — 161 = 21 < W,.., = 21.3. 


Only the first subset of seven means exceeds the calculated range. Therefore, we sub- 
divide the seven means into two subsets of six means each and compare their range 
with the calculated range. The subset of six means including z; and 28 is included in 
the larger group 23 to Z;, which has already been declared nonsignificant. The range 
for the subset of means for fat numbers 4, 3, 2, 6, 1, 5 is compared with the calculated 
range; thus: 


Z4 — 5% = 185 — 165 = 20 < Wars = 20.6. 


The process terminates, since the range is less than the calculated range. 

From the above test procedure the experimenter would conclude that significantly 
more fat is absorbed for fat number 4 than for fat numbers 7, 8, and probably 5. Like- 
wise, fat number 3 is absorbed to a higher degree than are fat numbers 7 and probably 8. 


proposed test is better than 


previously proposed tests for comparing differences between means. The test 
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allows the experimenter to commit fewer "type II errors" and more “уре I 
errors" than the test discussed in the preceding section. 

The test procedure is essentially the same as for the Student-Newman- 
Keuls test except that the values in table 11-3 are used instead of those in table 
II-1. The various steps in the test are illustrated in the following example. 


Example 11-9. The data from example II-1 and table II-2 are used to illustrate 
the calculations for Duncan's multiple range test. 

Step (I). There appear to be no natural groupings. 

Step (II). For the present example, choose the significance level œ = 5 per cent. 

Step (III). The standard error of the mean is sz = 4/141.6/6 = 4.86, with 40 de- 
grees of freedom. The calculated ranges (Duncan denotes these as least significant 
ranges) are computed as follows: 


Daas = 3.30 (4.86) = 16.0, 
D,.; = 3.27 (4.86) = 15.9, 
Dus = 3.22 (4.86) = 15.6, 
Dus = 3.17 (4.86) = 15.4, 
Da-i = 3.10 (4.86) = 15.1, 
D,.; = 3.01 (4.86) = 14.6, and 
D,» = 2.86 (4.86) = 13.9. 


The values 3.30, 3.27, ---, 2.86 are obtained from Duncan's tables [96] or by inter- 
polation from table II-3. ч 

Step (IV). The ranked means and the differences between the means are given in 
table II-2. The range of the largest and smallest means is compared with the calculated 
range; thus: 

Фа — £;— 185 — 161 = 24 > D,.s = 16.0. 

'Then, 
185 — 162 = 23 > D, = 15.9 and 
182 — 161 = 21 > D,-; = 15.9. 


Z4 — їв 
їз — 2; 


yon 


The next step is to compare subsets of six means with the calculated ranges; thus: 


д — у = 185 — 165 = 20> Drews = 15.6. 
| їз —– 38 = 182 — 162 = 20> Р = 15.6. 

d, — їв = see above. 
| Z — Z = 178 — 161 = 17 > D; = 15.6. 


The next step is to compare subsets of five means with the calculated ranges; thus: 
= = 15.4. 
Ф. — ā = 185 — 172 = 13 < Das 
i 5 — 25 = 182 — 165 = 17 > D» = 15.4. 
їз — Zs = see above. 
| fa — # = 178 = 162 = 16 > Dr-s = 15.4. 
i 22 — 2з = 806 


above. 
i — i7 = 116 — 161 = 15 < Р,-5 = 15.4. 


TABLE 11-3. Significant ranges for a 5% level new? multiple range test* 


Significant ranges for a 1% level new? multiple range test 


Biometrics 11:1-42, 1955. 


d with the permission of the editor of Biometrics from the paper by D. B. Duncan 


"Using special protection levels based on degrees of freedom. 


*This table was reproduce 
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Since the means for fat numbers 3, 2, 6, and 1 are included in the larger subset 43261, 
which was declared nonsignificant, the range of means in the subset 3261 is not com- 
pared with D,.4; only the range of means in subset 2615 from the subset 32615 is 
compared with D,4; thus: 


£4 — 25 = 178 — 165 = 13 < D, = 15.1. 


In the next subset of means (26158) greater than the calculated range the two subsets 
of four means each are 2615 and 6158. Both subsets have already been compared with 
the calculated range in larger subsets. "Therefore, the process terminates. 

The experimenter would conclude that the amount of fat absorbed with fat numbers 
4 and 3 is significantly higher than for fat numbers 5, 8, and 7, and the amount of fat 
absorbed with fat number 2 is significantly higher than for fat numbers 7 and 8. The 
remainder of the fats are said not to differ with respect to the amount of fat absorbed. 
The results may be pictured graphically as in figure II-1. 


lsd test 


ST 
a Р за серен тҮ 


Student-Newman-Keuls range test 


Oe Se Т LL СО\ 
CT a Se Ae RIEN 


Duncan's multiple range test 
p 
ЕБЕ A ҮЗ Т К NN 
Tukey's hsd test 
Д ————— 
161 162 165 172 176 178 182 185 
(7) (8) (5) (1) (6) (2) (5) (4) 


Figure II-l. Giaphical array of grams of fat absorbed for the eight fats in example 
1-1. The above bracketing is possible at the conclusion of various multiple range tests. 
The means within a bracket are asserted to be not heterogeneous, and means not bracketed 
together are asserted to be different. 


JI-1.1.4 Tukey’s test based on allowances. А multiple range test 
similar in application to the lsd test has been proposed by Tukey [295, 296]. 
He discusses several variations of the test, but only one of the procedures— 
the hsd (honestly significant difference) test, or the w-procedure—has been 
selected to illustrate his method. Using the hsd test and testing at the a per 
cent level, the number of experiments in which misstatements are made is « 
per cent out of all experiments. The v-treatment type I error is set at a per 
cent, and the two-treatment, the three-treatment, • • •, (v — 1)-treatment 
type I errors are less than a per cent. In Tukey's other tests the percentage of 
error may be set per line in the analysis of variance, per set of experiments, 
etc. [296]. i 

The value for the hsd is equal to the value of W,- , obtained in the Student- 
Newman-Keuls multiple range test. If two means (or groups of means) differ 
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by more than the hsd, they are said to differ. Also, the hsd may be used to set a 
confidence interval on the difference between any two means. Tukey [296] 
says that it may be more important to state that the true difference between 
two means lies in a specified interval than to state tha 
of means) differ significantly. A confidence statemen 
nificance statement, but the converse is not true. 


t the means (or groups 
t always contains a sig- 


Example I1-3. The procedure of applying Tukey's hsd test is illustrated with the 
data from example II-1. Steps I and II of the hsd test are the same as steps I and II 
of the Student-Newman-Keuls test. The third Step is to compute the hsd value, which 
is equal to 


Үл» = Фаз = 4.52 (4.86) = 22.0 = hsd. 


The difference between any two treatment means 


(table II-2) is compared with 
the hsd. If the difference exceeds this value, 


the two means are asserted to come from 
ferences, 24 — žē; = 185 — 161 = 24 and 
23, exceed the Аза. Therefore, it is stated that fat number 4 
degree than are fat numbers 7 and 8. Also, the true difference 


confidence intervals, 


II- 
hsd ќе 
than 


to illustrate the Link d 
.F E and Wallace 
Sel n s rom the original data [273, table 10.3 


ERE acp T TES 
ange й 39 36 298 20 33 30 41 2 248 
Mean 172 178 182 185 165 176 161 162 siste 

Instead of computing 


the hsd as in exam. 
and of the tabulated val 


: ple IT-3, use is made of the sum of the ranges 
ues in table hs which is a condensed version of a larger 
) : nt, the number in the group is 6, and th b 

of ranges is 8. The corresponding figure from table IT-4 is 0.55. The value of 248(0.55)/ 6 


TABLE II-4. Factors for allowances for one-way classifications* 


К) Number of Groups = Number of Ranges (v) 
no./group 
= no./range 


2 5.55 2.37 1.78 1.40 1.16 1.00  .8T 18 ло .66 .63 .58 .50 ат bh е о .38 .56 
Н 1.91 1.44 1.15 9% ВО ТО, E 56 „51 АТ à м 36 .55 32. 50 229 127 

1.65 1.25 1.01 8h .T2 «65 -91 91 т 05 AO 5T 2 435. 451 29 287 .eT 05 

5 1.55 1.19 .96 .81 ло .61 .55 50 45 2 .39 . 5 152 30) .29 27 .26 .25 

6 1.50 1.18 -95 s 2691 OL) + 8 A9 45 ә .39 ЖЮ 50 29 aT 26 .25 

T 1.49 1.27 .95 .80 .69 .61 .55 .50 5 bo .39 .36 .» . .0 29 28 .26 .25 

8 1.39: 1.17 .96 .81 ло  . .55 29 .6 ‚р 59 .7 .5 .535 34 29 .28 27 25 

9 1.50 1.18 .97 Т1 2 д 5 т 4d ЭП 0.35 55 .320 .30 06 27 26 

10 1.52 i20 .98 .85 12 65 T 52 т AO и. 939 R Je „0 .29 27 26 

1$ risk 

2 7.92 ә 2.96 2.06 1.69 1.39 1.20 1.05 gı .2 .75 .68 .65 2 „55 .51 ..58 46 43 

Ё 3.14 2.14 1.57 1.25 1.04 89 4 69 462°  .51 - AB A5 a 39 37 -5 à 2 

2.47 1.7% 1.55 1.08 91 -18 69 .G .56 591 GFT. .M o 5. " JG .51 29 

5 2.24 1.60 1.24 1.02 86 75 “ 59 .5h 9 u Jio. О 157 .5 255 .31 30 29 

$ 2.14 1.55 1.21 .99 .85 T 65 .59 .53 49 45 .he 2 ST 35. 533. 431.50. 128 

1 2.10 1.55 1.21 .99 8 ть 65  .59 .53 9 45 „Мә . 37 «55.35 52 30 29 

8 2.08 1.52 1.21 .99 .85 Л E 59  .5h .50 86 43 .ho 57 35 55 0 30 .29 

9 2.09 1.55 1.22 1.00 .85 15 .66 A 54 .0 6 .5 о .58 Sh. 1 229 

10 2.10 1.55 1.23 1.01 .86 75 61 61, .55 51 #1 .h go. 5 5 .55 .31 .50 


*This table was reproduced with the permission of T. F.. Kurtz, R. F. Link, D. L. Wallace, and J. W. Tukey, Princeton University, 
Princeton, New Jersey. . 
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= 22.7 closely approximates the hsd value = 22.0 obtained in the previous example. 
From here on the test procedure is exactly the same as for the hsd test. 


П-1.1.6 Comments on multiple range tests. The use of the lsd test 
leads to v-treatment type I errors larger than a = 5 per cent when the largest 
mean is compared with the smallest mean in a set of v treatments for v > 2. 
The size of the type I error is obtained from formula 11-1. In order to remedy 
this defect in the lsd test, other tests have been proposed. Some of these are 
described in the preceding sections. The choice among the three tesis given 
in sections 11-1.1.2 to II-1.1.4 will depend upon the size of the “type I error” 
desired. Duncan’s test has a larger “type I error" but a smaller “type П 
error" than either of the other two. Likewise, the Student-Newman-K euls 
test has a larger “type I error” and a smaller "type II error" than Tukey's 
test based on allowances. The choice between the tests will depend upon the 
relative importance of the errors. Discussion of the relative merits of the 
various procedures is limited to that given in unpublished reports [96, 97, 296]. 
The effect, of non-normality and heterogeneity of error variances upon the 
test has not been fully discussed [296]; despite this the tests described in 
sections II-1.1.2 to II-1.1.5 are considerably better than the lsd test and 
should be used in preference to it. The lsd.test, does not take into account the 
number of treatments in the experiment, whereas the other tests do. 

If the [sd test is used, the following procedure tends to decrease the size of 
the “type I error” and to retain the simplicity attached to the lsd and hsd tests. 
Among the n means, there are n(n — 1)/2 possible differences, The lsd test on 
all differences gives approximately the right two- 


1 treatment type I error. The 
trouble arises when only the largest differences are compared with the lsd. 


alone. Therefore, if the (0.05) (1) (n — 1)/2 smal 
lsd are asserted to igni 


П. MULTIPLE F TESTS 
II-1.2, 


Fisher’ AS y ! 
24] suggests the combins] a eant difference test, Fisher [126, sec. 


- If the observed 


"^ 
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for two and three treatments in the experiment. With more than three means 
in the group the “type I error” is larger than a, and in some cases it is as large 
as that obtained for the lsd test (section 11-1.1.1). Hence, the procedure cannot 
be regarded as satisfactory. 

The F test may be used with any multiple range test. The properties of 
such a procedure have not been discussed. Further study is required to deter- 


mine whether such procedures are worth while. 


11-1.2.2 Duncan's multiple comparisons test. In addition to com- 
parisons between any two means, comparisons between groups of means are 
often desired. Duncan [93-7] has developed a test procedure for such compari- 
sons. His multiple comparisons test resolves the difficulty of obtaining a non- 
significant range and a significant F value. The test procedure involves three 
stages. The first stage is essentially a multiple range test but with different 
ranges than any of those given in the previous sections, although the calcu- 
lated ranges are not too divergent from those obtained for the Student- 
Newman-Keuls test. The calculated ranges for the multiple comparisons test 
are obtained from tables II-5 or 11-6. 

The second stage of Duncan's multiple comparisons test involves the cal- 
culation of a set of least significant sums of squares and of the sums of squares 
among certain combinations of the means. The sums of squares among means 
are compared with the least significant sums of squares which are obtained 
from the least significant ranges. This comparison is essentially a comparison 
of observed F values with tabular F values, although sums of squares are used 
instead of mean squares and the tabular values in tables IJ-5 and II-6 are 
different from those used for ordinary F tests [273]. 

The third stage of Duncan's test is required only in certain instances. The 
observed range for the means in the group must be less than the calculated 
range for two means, and the observed sum of squares among means must be 
larger than the calculated sum of squares for the number in the group. This 
stage of the test is useful in detecting the significance of a comparison of 
groups of means. The various stages in Duncan's multiple comparisons test 


are illustrated with a numerical example. 


e II-2 and the estimated standard error of а 
eedom, are used to illustrate the procedure for 
ns test. The first step is to construct table 11-7, 
er table 11-5 or table 11-6, depending upon the 
with the preceding examples the 5 per cent 
level of significance is chosen. In table II-7 the first row contains the numbers 2, 3, ---, 
» = 8, the number of treatments in the group. The second row contains the values of 
R’, for 40 degrees of freedom obtained from table 1-5. The third row is computed by 
multiplying sz by the tabulated value of R’p.40 for р = 2, 3, +++, 8; for example, 
Rs = R'ass = 3.143(4.86) = 15.27. The fourth row 1s computed from the formula 
Ssp = 148,2; thus, ss; = МВ = 116.59. The symbol ssp indicates a sum of squares 


Example 11-5. The means in tabl 
mean, s; = 4.86 with 40 degrees of fr 
applying Duncan's multiple compariso 
using s; and values of R’p.40 from eith 
significance level chosen. To be consisten 


TABLE 11-5. Significant ranges, R^,,ar, for a 5% level multiple comparisons test* 


8 


3.871 
3.840 4 


346 5.595 3.814 


324 5.572 


5.792 


5.655 


Ah 5.691 
373 5.621 


3.783 


5.390 3.611 3.814 
3.558 5.580 


„ “A signifi- 


”” Va. J. Sci. 2:171-189, 1951. 


*This table was reproduced with the permission of the editor of the Virginia Journal of Science from the paper by Duncan, D. B 


cance test for differences between ranked treatments in an analysis of variance, 


f 
| 
| 


for a 1% level multiple comparisons test* 


pdf» 


R 


6. Significant ranges, 


TABLE II 


*This table was reproduced with the permission of the editor of the Virginia Journal of Science from the paper by Duncan, D. B., “A si, 
) D. D., signifi- 


cance test for differences between ranked treatments in an analysis of variance,” Va. J. Sci. 2:171-189, 1951. 
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significant at the œ = 5 per cent level. It is not necessary to compute the values of 
зэр until stage 2 has been reached. By this time, it may be unnecessary to compute all 
355, since the bracketed groups may contain fewer treatments than the larger values of p. 


TABLE II-7. Least significant ranges and least significant sums of squares 
caeca peo Тт 
Бас 


STAGE 1. А RAPID FIRST APPROXIMATION 


Tn the first stage, observe that, the difference between the largest and smallest 


means, i.e.,  — z; = 185 — 16] = 24, exceeds the value of Rs = 20.31. Since it 
does, the means of fats 4 and 7 are considered to be significantly different. Repeating 
the process, compare fat 4 with the other fats: 


74 — 28 = 185 — 162 = 23 > R, = 19.44, hence fats 4 and 8 differ. 
Z,— % = 185 — 165 = 20 > Re = 18.54, hence fats 4 and 5 differ. 
% — їр = 185 — 172 = 13 < Rs = 17.55, and the process terminates, 


% — č = 182 — 161 = 21> R, = 19.44, hence fats 3 and 7 differ, 
2з — Zs = 182 — 162 = 20> Вв = 18.54, hence fats 3 and 8 differ. 
ža — 24, = 182 — 165 = 17 < Ry = 17.55, and the Process terminates, 


4, and no further 
until the next Stage. Care should be taken i 


161162 165 З и ста 


(7) (8) (5) е le 178 182 185 
ORAT a) (6) (2) (3) (4) 
е 11-2. Gr i 9 
number of the fat E шар атау of eight fat means о 


parentheses, The above bu Pons II: 


Duncan's test, ing is possible 


1 with the identification 
at the end of stage 1 of 
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STAGE 2. TEST FOR RANGES AND SUMS OF SOUARES OF GROUPS 
IN BRACKETS 


In this step, repeated applications of the following rule are made: For a group of 
means bracketed together, say 2s, s, +++, Zn ranked in order, the difference Ën — 3i is sig- 
nificant if Zn — & > Re and if also the sum of squares of each combination of means, 
chosen from iy, Za, +++, Zn and including ®у and žm, exceeds the least significant sum of 
squares ssp, for p means, p being the number of means in the given combination. 

The first step is to compare the sum of squares among the n means in the group 
with ssn. The second step is to compare the range £, — #1 with Rz and with Rs, etc., 
until Rng > En — 1 > Rn for Ra > Ën — d. From this fact, it would be established 
194, 95] that the sum of squares among m means or less would exceed the corresponding 
585. If there are several sets of sums of squares among m varieties to compute, select 
the group giving the smallest sum of squares, and if this exceeds ssp, the remainder 
will also. 

Na Nagara [230] developed a procedure for determining which combination of 
means including the extremes yields the minimum sum of squares. Given the set of 
ranked means £j ds +++, £4 where 2, is the largest mean, ihe mean of the means 
z= D;/n, and the deviations Zj — 2 = yn Za — € = у ++, 2» — $ = ya the 
combination of n — 1 means yielding the minimum sum of squares will be the one 
excluding either Z2 or 2,3. If ГА! > Yn-ı, £2 is excluded, and if [эз] «yan Bn 18 


excluded. The minimum sum of squares is equal to (6 — 2): — yè — уё/(п– 1), 


where i is the value excluded, either 2 or n — 1. 
The minimum sum of squares among combinations of n — 2 means from a set of 
n means, including the two extremes, is obtained by the following procedure: 


(i) If 2y2 > (n — 2+ 1)(—Yn-1 — Уз), exclude zs and 2;. 
(i) If —2yn-1 > (n — 2 + D(ys + ¥n—2), exclude Zn—2 and n-i. 
(iii) If the inequalities in (i) and (ii) are reversed, exclude zs and Ў, 1. 
(iv) The minimum ss is equal to De —ap—yà8—»y?-—(-c yn - 2), 
where i and j refer to the pair of values excluded in steps (i) to (iii). 
Na Nagara [230] presents а procedure for obtaining the minimum sum of squares 


s of n — 3 means from a set of n means when the extremes are 


among combination А A 
additional steps. The results are illustrated 


retained and indicates the procedure for 
below. 
Applying stage 2 to the group o 
6, and 1, we find 
а) эш = 185? + 1822 + 178 + 176 + 172 
(185 + 182 + 178 + 176 + 172)° 
E^ 5 


f five means containing the means for fats 4, 3, 2, 


= 103.2 < 355 = 154.00. 


Hence, there are no significant differences in the group 43261. 


Considering now the group 32615, 
a) = 167.2 > 88% 
(2) 8832615 idt 17 > Ry = 16.48, 


23 = ТЕ = 

H 583, 

an = 135.80; ss325 > 592, 5565 > 883, 
d, therefore, 883015 > $54, 883265 > 55% p 


553615 > S94 
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and ssa; > ss; = 116.59. From (1) and (2), all relevant inequalities are satisfied; hence, 
fats 3 and 5 are considered to be different. In the above application, note that combi- 
nations of three and four means involving only the means for fats 3 and 5 are relevant 
and that the combination 555; is not required. 

Continue this process with the remaining groups of four from the above group of 


five, i.e., groups 3261 and 2615. The former group has already been included in the 
larger group 43261 and need not be tested h 


ere. The latter group will be tested in the 
larger group 261587. 
We are now ready to consider the last 


group of means, i.e., the six means for fat 
numbers 7, 8, 5, 1, 6, and 2. 


(1) 55:85 = 161* + 162? + 165° + 1722 + 1762 + 178° — (1014)?/6 


= 268 > ss, = 171.87. 
(2) 22 — % = 178 — 161 = 17 > Ry. 


The minimum sum of squares amon; 
the procedure described by Na Na; 
176 + 172 + 165 + 162 + 161)/6 
— 169 = 7. Hence, 


obtained excluding the mean 162 and is equal to ss78512 
= 209.2 > зз = 


S Yai = y; = 176 
is equivalent to that 


puted, since the range 2; — 2; > Ry. Hence, 
the means for fats 7 and 2 are asserted to be different. 


consider now grou 
both 7 and 2, or 78516 and 85162. Th 
Ze — #1 = 176 — 161 = 15 < В, 
now proceed to the stage 2 analy 


sis for each of the two sets of five means, since the 
ranges are less than the compute 


d ranges for Rs. For the first group, 
(1) 
(2) 


3318616 = 170.8 > ss; = 154.00 and 


26 — 2; = 15 > R: = 13.89. 


Since the range is only greater than R; and not Rz or Ry, 
of squares for various $858. The mini: 


we need to compute the sums 
8 among four means is the sum 
yi — y2/4 = 110.8 — (162 — 
Pie among combinations of 
t i and 8 or 1 and 8 is excluded and 
1з equal to 170.8 — (—5.2)# — (~2.9)2 [=] Ay/3 = 
e NA (=2.2)? — (71.4773 = 120.67 = 170.8 — (—5.2)? — 
M АП inequalities are satisfi 
e group of five means is b ki i i 5 
fee ML E БО en up into two groups of four; le., 7851 and 8516. The 
7851 is 

55-12-1601 < Rn, 
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The stage 2 test is 
(1) s$msi = 74 < ss, = 135.80. 


Ins. 


Hence, no differences are found among these four mea 
de. From the stage 2 analysis, 


Comparisons within the group 85162 are now ma 


we find 
(1) sses102 = 191.2 > sss = 154.00. 
(2) Za — ig = 16 > Rs, and the inequalities for three means 


$5952, $8912, $962 > 555 are satisfied. It is now required to check the sum of squares in- 
volving four means. The minimum sum of squares is 


$381: = 152 > ss, = 135.80, 


and all inequalities are satisfied. A difference between means 2 and 8 is indicated. The 
groups of four means to be considered now are 8516 and 5162. The groups are not 


changed by a stage 1 analysis: 


Ze — ig = 176 — 162 = 14 < В, 
апа d$ — 25 = 178 — 165 = 13 < Ry. 


The stage 2 analysis оп the group 8516 results in the following: 
(1) 53516 = 122.75 < ss, = 135.80, 


and the test terminates. No significant differences are included in the group 8516. 
We now consider the group 5162. 


(1) зѕывз = 98.75 < ss, = 135.80, 


and the test terminates. 


All groups have been separated into subgroups asserted to be not heterogeneous 
in the above analyses, and no further analysis is required for these data. Brackets 
include groups of means which do not contain significant differences (see figure II-3). 


From this it is concluded that 


7 is different from 2, 3, 4, and 6; 
8 is different from 2, 3, and 4; 
5 is different from 3 and 4; 


in the remainder of the comparisons the treatments are not different. 
2 "S f n means is greater than ss, = 14Rn° but the 


If oup 0: 

range i a ine ШИ ied it s necessary to conduct ү EY Ишан 
multiple co i . The stage 3 analysis consists of partitioning the п — 
ges of Ger dg p i of freedom contrasts. The бара ог шс 
Parisons contributing to the significant sum of squares are segregated in р е кы 
analysis. Duncan [94] suggests that inspection of the meas usually in pus e 
linear combination(s) of the means causing the significance. In the above examp! 7 ш 
Stage 3 analysis was required. If a stage 3 analysis had been required, a statement abou! 
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the comparison of means contributing to the significance of the sums of squares would 
be added to the bracketing in figure 11-3. 


i EECARSS У 
(M ау 
A сыз (| 


a ———є 
o — ————————_ 
161162 165 172 176 178 182 185 
(7) (8) (5) (1) (6) (2). (3) (4) 
Figure 11-3. Graphical array of eight fat means of example 11-1 with the identifica- 
tion number of the fat in parentheses. The bracketing is possible at the end of Duncan's 


test. The means within a bracket are said not to differ, but means not bracketed together 
are asserted to be different. 


In some cases the range of the n means in the group may be greater than Rz and 
the sum of squares greater than ss», but one or more of the sums of squares involving 
fewer means may be less than the 355. For such cases a stage 3 analysis is necessary. 


I-1.2.3 Scheffe's test. Scheffé [262] described а procedure for testing 
comparisons within a set of v means. The comparison may be decided upon 
prior to conducting the experiment or after the results have been obtained and 
studied. The test procedure is similar in application to Tukey’s test based on 
allowances. Instead of obtaining the multiplier for the standard error of a con- 
trast from a table of ranges (table 11-1) the multiplier is obtained from the 


ordinary F table (table 11-8) and is equal to 4/(v — DF. — Т, Јар = 8, 
where v = number of means in the experiment, о = level of significance, and 


f = the number of degrees of freedom associated with the error variance. The 


standard error for a comparison depends upon the number of means in the 
comparison and upon the coefficients of the means. For comparisons of two 
means the standard error of the comparison is equal to 4/2 sz, where sz is the 
Standard error of a mean. A comparison of the form z, -+ 2; 
standard error equal to 25;; etc. The follo 
the various aspects of applying Scheffé’ 
same nature as that committed using Tukey's test based on allowances. 
Ezample 11-6. The data of exam 


a i ple IT-1 are used to illustrate the application of 
Scheffe's multiple F test. The Standard error of the difference between two means is 
equal to 4/2 5: = L414(4.86) = 6.87. The value of Fos for v— 1 = 7 degrees of 
freedom in the numerator and f = 40 degrees of freedom in the denominator is equal 
to 225. The multiplier S is computed as follows: 4/(8 — 1)(2.25) = 3.969 = S. The 
quantity Ssz; ;, = 3.969(6.87) = 27.3 is computed; if the difference between any two 
nee ee Eid bas the two means are said to differ significantly. In the present 
, Wi 
S юш кер two means (table IT-2) are less th: 


— Ўз — d4 has а 
wing example will serve to illustrate 
8 test. The error committed is of the 


y an 27.3. 
: mparisons of only two means, co i i i 
means may be considered. For example, the experimenter Hema doo 


might wish to compare the 
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means for fat numbers 7 and 8 with other pairs of means. The standard error for the 
difference 24 + 2; — is — #7 = 44 is equal to /141.6{§ HE +ê + у = 9.72, and 
Sz4435-z5-2, = 3.969(9.72) = 38.6. The above difference exceeds the computed value 
and is declared significant. 
Another contrast of possible interest is the mean of the lowest three means and 
the mean of the highest five means; ie, (Z; + 28 + 25)/3 vs (41 + 2 + 224-23 
+ &)/5, or (185 + 182 + 178 + 176 + 172)/5 — (161 + 162 + 165)/3 = 178.6 


— 162.7 = 15.9. The standard error for this contrast is UIS + | = 3.55, and 


the computed significant difference is equal to 3.969(3.55) — 14.1. The observed 
difference, 15.9, exceeds this value, and the means of the two groups are said to differ 
significantly. Other multiple comparisons may be made if desired. The procedure is 
first to determine the standard error and then to multiply the standard error by the 
value of S = 3.969. The observed difference is compared with the calculated value in 
order to determine its significance. Scheffé's test is completed with a statement about 
the significance of the various comparisons. 

11-1.2.4 Comments on multiple F tests. An F test comparing the 
mean square of the means in a subset with the error mean square may be cal- 
culated after any multiple range test. The combined use of range tests and 
variance ratio tests results in a multiple F test. As stated previously the 
properties of most of these tests are unknown. 

Three multiple F tests are presented in this section. The first one involving 
an F test and subsequently an lsd test has undesirable properties when the 
number of means in the group becomes sizable [97]. Of the other two multi- 
ple F tests, Duncan’s multiple comparisons test has a larger "type I error" 
but a much smaller “type II error" than Scheffé's procedure. 

Any known or implied grouping of the treatments should be taken into 
account prior to using a multiple range or multiple F test procedure. After 
doing this, comparisons between two means will tend to dominate the kind of 
comparisons made. Since this is so, one of the multiple range tests or Duncan's 
multiple comparisons test should be used in preference to Scheffé's test. The 
size of the calculated intervals of two means is 27.3 for Scheffé's test and 22.0 
for Tukey's test based on allowances for the example. In order to be declared 
significant under Scheffé's procedure, a difference must be 3.9694/2/4.52—1— 
24 per cent.larger than for Tukey's test based on allowances when n — 8, 
f = 40, and а = .05. If comparisons of only two means are being made 
Scheffé's procedure would be inefficient relative to Tukey's test based on 
allowances. For comparisons involving several means, Scheffé's procedure is 
more efficient than Tukey's test based on allowances [296]. 

Duncan [96, 97] sets the type I error rate at œ, for rejecting the null 
hypothesis that ш = us = *** = n» When it is in fact true. The error rate 
ap must be computed for each value of p. For р = 2, o; is equal to a. In 
order to compute the least significant range for n means, Duncan uses the 


AC ise шешш 
relation 4/ 2(n — 1)Fa,(n — 1,fdf) ss = R'nsz; Scheffé uses the relation 
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(n — \)уЁ (п — 1, fdf) ss for comparisons between two means. Duncan 
sets his error at a, = a», and Scheffé sets his error at a. This is the only 
difference in the two formulae for the comparison of two means. Duncan’s 
significant range changes when p, the number of treatments in the subset, 
changes. Thus, for n — 1 = p treatments, Duncan’s least significant range 
is equal to 4/2(n — 2)F., (п — 2, fdf) sz. With Scheffé's procedure, groups 
of means which are said to be not heterogeneous may have a large mean square 
relative to the error mean square.This disagreement is not found with Duncan 8 
procedure. 

As the F analogue of the Student-Newman-K euls multiple range test, one 
could compute the various values, S,., = V (п — DF. — 1, fdf), Sa 
= Va = Fn — 2, fdf), ++, Ss = V/2F.Q, fdf), and S: = VF, faf) 
= tasas: The particular S; used would depend upon the number of means in the 
comparison. Tukey [296] states that there is little to recommend such a test. 


її-13 TUKEY'S GAP, STRAGGLER, AND VARIANCE TEST 


Tukey [292] developed a test for detecting non-heterogeneous subsets of 


means within a set of ranked means. The test consists of first finding gaps in 


adjacent mean differences which are larger than the lsd = ta, jys? V2. and 
subdividing the set of ranked means into groups. The next step is to separate 
straggler means (extreme deviates) from the mean of the group. The last step 
is t6 compare the variation among means in the subgroups with the error 
variation using the F test. 

Although Tukey [296] states that this test is now obsolete, it is presented 
to illustrate the procedure and because it is useful in selecting non-heteroge- 
neous subgroups. The test represents a combination of multiple range and 
multiple F tests after locating gaps in the ranked means [97]. 

A multiple range test compares the extremes of a group, whereas the gap, 
straggler, and variance test compares an extreme mean with the mean of the 
group. Hence, no confusion should result if the two tests do not agree with each 


un or with the F test. Different hypotheses are being tested by the various 


For Tukey's test, as with the others, an independent estimate of s", the 
error variance for the set of treatments, is required. The number of replicates 
per treatment should be equal, and all treatments should be subject to the 
same error variance s?. If the errors vary from treatment to treatment, none 
of the tests holds, and a transformation of the data (see section 11-2) will be 
necessary to stabilize the treatment variances. 


The various steps in performi 3 k 
data of example II-1. performing the Tukey test are illustrated with the 


Example 11- i ‚ 
grouping Mini d Tel в gap, straggler, and variance test, any natural 
S t. Assuming then that a single natural (biologica!, 
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physical, or sociological) grouping is available, we proceed as follows for the data of 
example II-1. 

Step (I). Choose a level of significance = а. For this example, a = 5 per cent. 

Step (II). Calculate the difference which would have been significant if there were 
but two treatments. This difference is commonly known as the “least significant dif- 
ference” (lsd). The lsd is equal to szlosogjA/2 = 2.0214/2(141.6)/6 = 13.88, where 
fos is the two-sided 5 per cent value for ¢ with 40 degrees of freedom. 

Step (III). Arrange the means in order of magnitude and consider any adjacent 
difference larger than the Isd as a group boundary. 

The differences of adjacent means from highest to lowest (values given on the 
diagonal of table 11-2) are 3, 4, 2, 4, 7, 3, and 1. None of these differences is larger 
than the 54 = 13.88, and we conclude that there is but one group. 

Step (IV). In each group of three or more means, find the group mean Z,, the most 
divergent or straggling mean Za, and convert the ratio 2:21. 
normal deviates by finding 

1a — tel Sign 
829 (for n > three means іп a group) 


shit rael 


into approximate 


or 
|a = žel mh 


ERIE, (for three means in a group). 


T 
35+ Fine 
Separate off any straggling mean for which this is significant at the chosen two-sided 


la, odf- 
The mean of the group of eight means is 172.6. The most divergent mean is 185; 
therefore, 
— i 6 
185 — 1726 6...8 
1 1 = = 1.78, 
з} к 


rmal deviate, 1.96, at the 5 per cent level. Hence, by Tukey’s 
test, none of the means is separated from the group mean, 172.6, as stragglers. Upon 
applying the F test, a significant value of F is found, F = 503.9/141.6 = 3.56 > Fos 


(7 and 40df) = 2.25. With such examples as the above, means spread out rather than 


clumped, Tukey’s test does not separate straggler means from the group means even 


though the F test indicates that differences are present. — 

The additional steps in Tukey’s gap, straggler, and variance test are used when an 
extreme deviate in the group of n means is obtained. The mean of the n — 1 means 
omitting the extreme mean is obtained, and the procedure in step (IV) is applied to the 
group of n — 1 means. The process continues until no more extreme deviates are ob- 
tained. 

Step (V). To the subgroups of three or more extreme deviates, apply the procedure 


in step (IV). 


which is less than the по 
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Step (VI). Compare the variation among treatment means in the subgroups with 
the error variance. 


The additional steps in Tukey’s gap, straggler, and variance test are illustrated 
in the original paper [292]. 
П-1.4 SELECTION OF LARGEST n MEANS FROM A SET OF v MEANS 


In certain experiments, (e.g., variety yield trials) it is desirable to include 
the largest mean (or the n largest means) and all means not different from the 
largest mean in the group considered to be "superior" with respect to the other 
means in the experiment. If the group interval, вау (max. — Asz) tO Fmax., iS 
large, then the size of a type I error, the error committed by misclassifying 
the treatments when they are equal, is small, but the size of a type II error, 
the error committed by misclassifying the treatments when they are different, 
may be large. If the type II error is not specified and the type I error is set 
equal to a, say 5 per cent, one of the multiple range tests may be applied. 

For the particular case of partitioning a set of v means ranked in order, 
вау Tı, 22, +++, 2, = Fmax., into an inferior and a superior group, Paulson [237-9] 
has provided the mathematical formulae but has not evaluated the various - 
integrals in tabular form. The number of decisions made possible by parti- 
tioning v means into an inferior and a superior group is 2° — 1. For v = 3, 
seven decisions are possible; these are the first, seven listed in section II-1. 
Paulson suggests that the situation ш = из = 5 = дуу = pandy, = p + A 


be considered in computing type II errors. A type I error of size œ is computed 


as 


ia da range of v means — 15 
gt Батып error ofa mean ^ s; < à), ara 


where p(w < a) represents the probability of obtaining a range less than 


Asz. The type II error, say 8, 


of the kind suggested by Paulson [237] is 
computed as 


Zi £, а 
BOY PE о Morenhi 1,2, 9-1). (11-3) 
Tables for Ee < 3) have been prepared by May [208; table II-1] for 
v= 2,3, +++, 20 and for various degrees of freedom in Sz. 

Paulson’s [237] formulae, (11-2) and (11-3), may be extended to the case 
esired to select as few treatments as possible so as to obtain a 

reasonable certainty of including the n largest ones; thus: 
aul = panes below v — n + 1 th mean < a) 


3i (11-4) 


and 


=1— pl 12.2, 8 
B=1 В T Mor each i = 1, 2, ++, v — n). (II-5) 
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Аз before, values for formula (11-4) may be obtained from May's tables or 
tabulated from formula (3) in Pearson and Hartley's paper [244] for various 
numbers of individuals in the group and degrees of freedom associated with 
the standard error. 


II-2 Transformation of Data 
II-21 GENERAL COMMENTS 


The selection of a scale of measurement will depend upon 
(i) the nature of the data and 

(ii) the type of statistical procedures to be used. 

The above two conditions are not incompatible, since the scale of measure- 
ment may be purely arbitrary for certain data. For example, the nonmetric 
(e.g., small, medium, large) system would not create any real misgivings among 
experimenters. Transformations such as the logarithmic or square root may 
cause some objections. Such objections are, in some cases, merely personal 
prejudices, and if presently known statistical techniques are usable for log X 
or УХ but not for X measured in the ordinary units, say pounds, there 
should be little reason for not measuring the variable X on another scale. 

А transformation of X to some function, f( X), should be made considering 
the two conditions set out above. If the principles of (i) are not violated and 
if the purpose for which the transformation is made is realized, the new func- 
tion should not cause any confusion. Bartlett [15] lists the following require- 
ments for an ideal transformation: 


(i) The variance of the transformed variate should be unaffected by changes in 
the mean. 
(ii) The transformed variate should be normally distributed. 
(iii) The transformed scale should be one for which an arithmetic average from the 
sample is an efficient estimate of the true mean. 
(iv) The transformed scale should be one for which real effects are linear and 
additive. 


The above conditions are related to some extent, since conditions (i), (ii), and 
(iv) usually imply (iii). However, a transformation selected to satisfy (i) may 
not satisfy the remaining conditions, and transformations satisfying condition 
(ii) may not fulfill requirement (i). The nature of the data and the type of 
statistical analyses used govern the importance of the above requirements. 
The following sections deal with a selected set of transformations which 
are considered useful in experimental work. Other transformations are dis- 
cussed in the references cited at the end of the book. The theoretical con- 
siderations concerned in making transformations are more fully discussed by 
Curtiss [81], Deming [87], Bartlett [15], Bartlett and Kendall [16], and 


Eisenhart et al. [102]. 
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The assumptions underlying the analysis of variance and the usual tests 
of significance are discussed by Eisenhart [100] in Biometrics; in the same 
issue, Cochran [54] discusses the consequences when the assumptions under- 
lying the analysis of variance are not fulfilled. These two papers, plus the one 
by Bartlett [15], are recommended reading for those interested in the assump- 
tions underlying the analysis of variance and in methods of processing experi- 
mental data when the assumptions are not satisfied. 

For the general case, let the variate X; be transformed to some new 
variate, say f(X;), in such a way that f(X;) is normally and independently 
distributed with mean y and variance c?. For a sample of size n, this may be 
represented symbolically as 

les 5 
T. e з 2,0090 -0] n 
dF(Xi, Xs, +++, Xn) = | I ч Taf(X,). (II-6) 


ov 2r 


The function f(X;) necessary to normalize the original variate X; may be 
known for some variates X; and unknown for others. If the distribution of 
the X; is known, it is possible to find the correct f(X;) for normalizing the 
data [212]. 

It should be remembered that a function f(X;) may normalize the data for 
each of the » treatments, but that the treatment variances may not all be 
estimates of the same population variance as is required in the analysis of 
variance. It may be impossible to find a transformation which results in 
homogeneous error variances. Methods for combining data with heterogeneous 
error variances are described in various places [43, 56, 59, 111, 209, 332]. 

In situations for which the population variance is a function of the popu- 


lation mean, say о? = g(u), the approximate transformation! is found by 
evaluating the indefinite integral, 


ахд "i 
Veoh KX), (11-7) 


and using the transformation F(X) on the X variates [15]. Occasionally, it is 
known that the sample means and variances are related by some unspecified 
function. If enough sample means and the corresponding variances are avail- 
able it may be possible to evaluate the relationship and to find the transforma- 
tion from equation (II-7). The resulting transformation tends to make the 
Means and variances independent. 
II-2.2 SQUARE ROOT TRANSFORMATION 
Brie oe root transformation is useful for data following the Poisson 
on. Counts of events having a small probability of occurrence often 
"The appropriateness of t 


һе indi i TY 
of the standard deviation of X and да o үе оа 


§ 11-2.4] The Logarithmic Transformation 47 


follow thís distribution. In the Poisson the variance is equal to the mean. 
cg? = y. Hence, by equation (11-7), we find that | 


ах 
Sk = 2уХ, (1-8) 


which indicates that the square root transformation is the appropriate one. 
The constant may be ignored. If the variances are proportional to the means, 
the square root transformation stabilizes the variances. ў 


Bartlett [15] has shown that УХ + 1/2 for numbers between zero and 
10 is more nearly correct than 4/ X. Even for numbers up to 15, Curtiss [81] 
indicates that VX + 1/2 is somewhat better than УХ. As a further refine- 
ment, Kendall [179] has proposed 4/ X + 1/2 + k and has given the method 
for evaluating Ё. The additional refinement over УХ + 1/2 usually is of little 
importance in experimental work, but the choice should be made by: the 


experimenter [also, see 4, 131]. 
II-2.3 THE ARCSINE OR ANGULAR TRANSFORMATION 

For data having binomial variation, it will be found from formula (II-7) 
that the arcsine transformation, 


dX opm 
f Vian 2 sin VX, (11-9) 


for X measured in fractions, is effective in stabilizing the variances. Snedecor 
[273] provides a convenient table (table 16.8) for making the angular trans- 


ion [also, see 129]. Bartlett [15] suggests that z or = be substituted 
1 : d 
— me substituted for X equal to one 


formati 
for X equal to zero and 1 - gol 


[also, see 4, 102, 131]. 
JI-2.44 THE LOGARITHMIC TRANSFORMATION 

For certain types of data the standard deviation is proportional to the 
2. From equation (11-7), we find 


гах lio X. 
Js Vd g X. . (1-10) 
transformation is appropriate. This transformation 
effects into additive effects. 
— a) and not log X may be normally distributed. 
it must be evaluated from the data. The procedure 
set f(X;) equal to log (X; — a) in equation (11-6) 
timating р, о, and a by the method of maximum 
od of moments [75, p. 258]. 


mean; ie. c? = E 


Thus, the logarithmic 
converts multiplicative 

The quantity log (X 
Since o usually is unknown, 
for estimating а would be to 
and obtain the formulae for es 
likelihood [62] or by the meth 
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a + X; 
Transformations of the nature of log (X; — a), log (a — X;), log =) 


log x) and log es == 2) have not been used as extensively as they 
BIDS ahold be, and it is felt that their popularity will increase as the 
experimenter becomes more familiar with them. The values for ein the other 
transformations may be obtained in the same manner as outlined for log 
(X; — o). The latter transformation is appropriate for data having a lower 
bound on the X variate; e.g., number of nodes on a plant. Log (а — X;) would 
be suitable for data having an upper bound equal to о on the variation of the 


XE A К, 
X;. The transformation log (E 25 A) is the general form which includes 
g= ў, 
Fisher’s [127] z transformation for 


1 


z= 1 log. G EN, In this special case the parameter c? may be found when 
the size of sam; 


ple is known. This is also true for certain o 
(15, 81]. 


the correlation coefficient; ie., 


ther transformations 


A frequently occurring problem in ex 
variance for a se 


ysis of variance They present an illustrative ex 
ample, discuss the theoretical implications of the transformation, and indicate 
what happens when the individual vari 


П-2.5 OTHER TRANSFORMATIONS 


Bartlett [15] and others [19, 22-9 
tional transformations, Some of these 
by Bliss [23, 24] and the logi 


4, 126, 129] have discussed some addi- 


t transformation developed 
1.08 P/(l — p), (22, 129] for 
for ranked data 


H-3 Test for Homogeneit: 


y of Variances 
Tests of Significance of variances from normally distri 
1 Y distribu i 
available for testing the equality of “Чул Ж. 
(i) two independe es (Snedecor’s F Fisher’ 
Ч , 82, is" 

p and Fisher and Yates’ Variance ratio [129], i ieee у 
Gi) k independent variances, [14, 16, 43], 
(ii) two variances with unknown correlation [213, 248], 
ey Ё variances and associated Covariances [309], 
ап 


nt уУагїапс‹ 
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(v) the variances and covariances within each of several sets and the covariances 
between sets [298]. 


In addition to the above a test is available for testing the proportionality of 
k independent covariance matrices [110]. 

A rather simple procedure for detecting obvious heterogeneity among a set 
of k independently estimated variances is to compare the ranges and meaus of 
the samples. To illustrate the procedure, consider the data of example I-1. 
The ranges and means for the ten samples are 


лизо ЗЕ BARES 20863 2:112 RES ORO SATO 
Mean | 25.4 26.8 27.0 28.0 29.0 30.0 31.8 32.0 34.0 38.0 
Range | 27 18 23 22 22 25 30 23 26 18 
There appear to be no wide discrepancies among the ranges or any relationship 
with the means. For larger numbers, say larger than fifty, of ranges the ob- 
served frequency could be compared with the tabulated frequencies [244] by 
a chi-square test. 

Box [34] has shown that Bartlett's test for homogeneity of variances [14] 
is almost as sensitive for testing non-normality as for testing heterogeneity of 
variances. Thus, it is essential to have the original data normally distributed, 
or nearly so, before computing sample variances. 


II-4 Test for Additivity of Data 


Tukey [293] presents a method for isolating a single degree of freedom 
associated with nonadditivity in a two-way classification. The degree of 
freedom is isolated from the residual sum of squares and compared with the 
remainder mean square in order to test the hypothesis of additivity of data. 
Two well-known stituations wherein the sum of squares due to the one degree 


of freedom for nonadditivity is increased are 


(i) where one or more observations are unusually discrepant 


and 
(ii) where the row and column effects are not additive. 


The sum of squares associated with the one degree of freedom for non- 
additivity is 
[2 £X45.-26,-2] (IL-11) 
del j= 
X (8. — 2)? 5 @.; — £)? 
Li i 


y 


where X;; represents the individual observation in an r X c two-way classifica- 


tion, 2;. represents the row mean, T.j represents the column mean, and £ 


represents the mean of the rCXj 8. y 
The quantity obtained from equation (IT-11) is the linear row by linear 


50 Some Useful Statistical Tools and Concepts [$ П-4 


column interaction sum of squares with one degree of freedom [273, et "ia 
Although this type of interaction appears reasonable for a number о xime 
tions, there is no reason why the linear row by quadratic column ш Д s 
the quadratic row by linear column interaction, or the quadratic by quadra 


interaction could not account for nonadditivity. Also, the nonadditivity could 
arise from more than one source, 


appropriate transformation is d 

Although other tests for a 
Situations, some are available. 
sec. 26.2] and Kempthorne [175, sec. 8.3], D. S. R 


special tests. One of these is presented in example VIII-1. The second test 
involves the use of a standard or check treatment in an experiment. The 
analysis of variance for the completely randomized design 18 


Average value of mean square 


Source of variation df Additive model 


Multiplicative model 
Among treatments 


Yim 1 


Within treatments (у —1)(т — 1) о? n 2/(у —1 
(excluding check) à БУ =) 


Within check treat. 


where Y; is the 
in the experiment, b; i 


1Unpublished Tesults, Cornel] University. 
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treatment, 2,; is the value associated with the particular experimental unit 
when no treatment is applied, and и; is the mean of the ith treatment. If the 
b; are all equal to one, equation (11-12) becomes the additive model [175, sec. 
8.4]. Under the additive model the average values of the within treatment and 
of the within check variances are both equal to o°; under the multiplicative 
model the mean squares have the average values given in the above analysis 


of variance table. 


Il-5 Nonparametric Tests in the Analysis of Variance 


A great deal of importance has been placed on statistics developed for 
variates which are normally distributed. This has been justifiable since most 
distributions, or their transforms, are not often so abnormal but that the 
distribution of means is near enough normal for the usual tests of significance 
to apply to a sufficient degree of accuracy. When the appropriate transforma- 
tion is unknown, statistical methods independent of the parent distribution 
are required. Such methods are called nonparameiric or distribution-free 
methods and are usually based on order statistics; e.g., the median. Mood 
[212] presents a number of these techniques in Chapter 16 of his book. One 
of the techniques is suited for the analysis of a two-way classification with one 
or more observations per cell and with a test of the main effects against the 
interaction. In some cases, considerable time may be saved by using Mood's 
method instead of the ordinary analysis of variance procedure. 

Wilcoxon [306] has presented a nonparametric test for a two- or more-way 
classification involving the ranks of the treatments in one of the classifications. 
The computational procedure is simple and the assumption of normality is not 
required. Several numerical illustrations and the necessary tables have been 
supplied in Wilcoxon's instrüctive paper. 

In comparing paired observations, one may use a plus and minus sign test 
of the differences and compare the sample results with expectations from the 
binomial (1/2 + 1/2)" or with x? [90]. Fisher (126, sec. 21] presents a test for 
comparing the observed results with theoretical results without using the 
assumption of normality. Nonparametric methods for other situations are 


discussed in various places in the literature [e.g., 37, 83, 151]. 


2 
II-6 Probability Levels for t, F, and X 

tics known as chi-square, x°, Fisher's 2, H 
]l known to statisticians but not to all 
are set forth here in order to avoid 


The relationship among the statis 
Snedecor’s F, and Student's t are we 
users of statistics. These тшш 
confusion in later sections of the book. $ 

Fisher's z is equal to log. A/F, where F is Snedecor's F. If tables of per- 
centage points are available for 2, it is a relatively simple operation to obtain 


F tables and vice versa. Likewise, tables of the percentage points of the correla- 


* 
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tion coefficient may be computed from tables of percentage points for F (or 

2), since R 
Fa —1,k — пар П Rk = ny 

where F has n — 1 degrees of freedom in the numerator and (k — n) in the 

denominator, R? is the correlation coefficient between the dependent variate 

and the n — 1 independent variates, and k represents the sample size. The 

zero order or total correlation coefficient r is obtained when n = 2. И 

Merrington and Thompson [210] have tabulated several percentage points 
(.50, .25, .10, .05, .025, .01, and .005) for the F distribution with various 
numbers of degrees of freedom associated with the mean squares in the num- 
erator and in the denominator. Fisher and Yates [129] have tabulated the 
20, .10, .05, .01, and .001 percentage points for F. The percentage points for 
the F distribution for .10, .05, .025, and .01 are presented in table II-8. For 
more extensive tables of F the reader is referred to Snedecor [273, table 10.7], 
to Fisher and Yates [129, table V], and to Merrington and Thompson [210]. 

Student's ¢ with n; degrees of freedom is equal to the square root of 
FQ, nedf), or t(nidf) = FQ, nedf). Thus, if F tables are available, £ tables 
may be computed by obtaining the square root of the Ё values for one degree 
of freedom in the numerator and n; degrees of freedom in the denominator. 
The square root of the F values in the second column from the left-hand-side 
of table IT-8 yields a £ table. The number of degrees of freedom associated with 
t are those in the first column on the left-hand side. The two-sided 1 percent- 
age points correspond to the percentage points for the F distribution. A graph 
of t values is presented in figure 11-4. 

In addition to graphing ¢ values and 
pared a graph of x? values and percentage points for various degrees of freedom 
(figure IT-4). The equality of 
sented on the same graph. А]: 
points may be obtained from the Р table by m 
T; = œ by the degrees of fre 
a x? table could be prepare 


2 


since the quantity +/ 2x°(f df) — Vf — lisa 


d variance unity 
greater than 1.64, the observed x? 


Significant at the 5 per cent level, 
For percentage points of F, t, and xX? not in tables 
uu Е approximating these values. Bancroft [5 6] 
tables that have been prepared and i i 
NS illustrates the pro 


pproximately normally distrib- 
[127]. If 4/25 df) — A/2f — 1 is 
value with f degrees of freedom is adjudged 


formulae are available 
lists the references to 
cedure for calculating 
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TABLE 11-8. Percentage points for the F distribution* 


10 per cent points 


5 per cent points 


2.5. per cent points 


TABLE 11-8 (continued). Percentage points for the F distribution* 


Ons 
© © 0 


1 per cent points 


C. M., “Tables 
tables for slatisticians, 


pson, 


paper by Merrington, M. and Thom 


*This table was reproduced with the permission of the Biometrika Trustees from the 


of percentage points of the inverted beta (F) distribution,” Biomeirika 33:73 


-88, 1943, and from table 18 of Biometrika 


Vol. 1, Cambridge University Press, 1954, by E. S. Pearson and H. O. Hartley 


CHAPTER III 


Plot or Pen Technique 


The subject of “plot technique” or “pen technique” is defined herein as 
the topic dealing with the effect of individual items upon each other and of 
experimental units! upon each other (competition), the size and shape of the 
experimental unit, the number of experimental units per treatment, and 
methods of sampling the experimental unit. Although the term “реп tech- 
nique” is not commonly used, it could be used to describe technique for 
animal experiments in much the same way that the term “plot technique” 
is used for plant experiments. The problems may be different, but the 
principles are usually related if not identical. Similarly, the term “laboratory 


technique” could be applied to procedural methods applicable to laboratory 
experiments. 


АП items described above, as well 
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required number of replicates for a Spec 
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III-1 Competition 


Competition is defined as the effect of one individual or set of individuals 
upon another individual or set of individuals. The term as used in this text 
has a broad meaning. Similarly, the terms "treatment" and "yield" often 
have broad definitions in experimental design literature. In example 1-1, 
"competition" was introduced into the experiment with the "balanced" ar- 
rangement that required that all sample means be equal; this could be called 
induced compelilion; the variation among individuals is larger than it should be; 
i.e., an additional component of variance has been added to that due to the 
random variation among individuals. 

"There are two kinds of competition of importance in designing experiments 
and in testing the significance of the results; these are intra-experimental unit 
competition and inter-experimental unit competition. 'The former term refers 
to the effect of individuals upon each other within the experimental unit, while 
the latter refers to the effect of experimental units upon each other. 


Ш-1.1 INTRA-EXPERIMENTAL UNIT COMPETITION 


Intra-experimental unit competition exists when individuals within the unit 
affect each other either advantageously or detrimentally. Examples of this type 
of competition abound in experimental work; for example, plants in a field plot 
are in competition for light, water, and nutrients; animals within a pen or cage 
are in competition for food and water; workers in the laboratory may have more 
results alike when working together than when working alone; some animals in 
a pen are more pugnacious than others, often causing some physical injury to 
other animals, etc. If the variance among experimental units is utilized to test 
treatment differences, the intra-plot competition has no effect upon the experi- 
mental technique as long as the sum of the competition effects within the unit 
is zero; this requires that one individual gains while another loses. If the varia- 
tion among individuals or sampling units within the experimental units is 
utilized in tests of significance, the error variance may be wrongly estimated 
because of competition. 

More space or more time between individuals or units may be utilized to 
overcome intra-experimental unit competition. In plant work, this may require 
more space between plants. In laboratory work the technicians could work on 
different samples, be placed in individual booths, or be given the proper training 
to eliminate "competition." It is often desired to estimate the environmental 
and the genetic components of variance among individuals within an experi- 
mental unit, If intra-experimental unit competition is present, it may be im- 
possible to estimate the genetic component of variance. The objectives of the 
experiment and the choice of the experimental error variance will determine 
whether or not intra-plot competition is to be ignored. 

Suppose that there are sampling units or individuals per experimental 
unit amd that k of the h units are drawn at random. The variance of the mean 
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It has been found that sore crop varieties adversely affect other varieties grown 
adjacent to them (e.g. tall and short varieties) [193, 196]. Some varieties 
apparently are better “competitors” than others. Plants on a plot with no 
fertilizer treatment or with no irrigation treatment may draw some of their 
nutrients or water from the treated plots. Animals in the end pens or cages of 
individual feeding trials may not fare as well, since they lack "company" [98]. 
Likewise, laboratory technicians may not perform as efficiently if completely 
isolated from other technicians. 

The simple expedient of providing more or less separation (either through 
time or space) of experimental units is usually sufficient to remove the effect of 
competition. If competition effects are likely, the experimenter might widen the 
distance between rows and decrease the distance between plants within the 
rows. In this way the same number of plants per acre is maintained. The intra- 
row competition would be severe, but there would be negligible inter-row com- 
petition. If'the row is the experimental unit and is harvested completely, the 
competition component of variance may be essentially eliminated from the 
experimental error variance. Also, such a procedure tends to correct for slight 
variations in stand. Competition may be controlled in the manner suggested 
above or by border or guard material which is discarded at harvest. However, 
the ratio of guard or border material to experimental material should always be 
considered relative to alternatives. The cost of maintaining the guard material 
relative to the cost of maintaining the experimental unit is considered in sec- 
tion I1I-2.2.2. 


III-2. Size of Experimental Unit 
III.21 FACTORS AFFECTING THE SIZE OF THE EXPERIMENTAL UNIT 


The optimum or recommended size for an experimental unit cannot be given 
without first considering a number of factors. Ап attempt is made to include all 
factors, but undoubtedly some are omitted in the following list: 

(i) Practical considerations: Certain practical aspects may dictate the size of 
an experimental unit. In animal experiments the pens or cages may be already 
constructed and not easily changed. The pasture or paddock size may be deter- 
mined by those already available and fenced. The size of a field plot may be 
determined by the facilities available for handling plots. If grain combines and 
other power equipment are used, a fairly large plot may be essential. On the 
other hand, power equipment may be unavailable and the experimenter is 
forced either to use small plots or to subsample. In other experiments, e.g., 
ilities may dictate the use of small experimental 


greenhouse studies, the faci 1 У : r 
units. The total amount of experimental material available is a factor in deter- 
mining the optimum size of a unit. The experimenter may have only a fixed 
number of animals, greenhouse pots ог flats, or of plants or seed, or a fixed area 


of land available to him. Ҥ the size of the experimental unit and of the experi- 
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mental area is fixed, the number of experimental units is automatically deter- 
mined. 

(ii) Nature of experimental material: The type of experimental material is 
associated with the practical considerations discussed above, but some addi- 
tional features are worth noting. The particular material under study will 
determine plot or pen size to a large extent. Certain types of variability and 
cultural operations are associated with the kind of crop or animal. The opti- 
mum plot size for a variety yield trial for oats is different from that for corn. 
Likewise, the pen or cage size for chickens is not the same as for beef cattle. 

(iii) Number of treatments per block or per incomplete block: The number 
of treatments per block or per incomplete block may be kept within rather 
narrow limits simply by the choice of a design. If large numbers of treatments 
are to be tested, an incomplete block design (Chapters IX-XIII) may be used. 
For a given design, say the randomized complete block (Chapter V), and a 
variable number of treatments, smaller'plots may be selected for the larger 
number of treatments. 

(iv) Variability among individuals or units within the experimental unit 
relative to the variability among experimental units treated alike: The var- 
iance of a treatment mean, sz, is proportional (equal in most cases) to 

yy pi 
= PD (III-4) 
where V,’, V,’, and kare defined in equation (III-1) and r represents the number 
of experimental units per treatment. The relative size of the two components 
of variance V,’ and У, has considerable effect upon the optimum experimental 
unit size. 

(v) Cost per individual or unit relative to cost of experimental units: Let 
С, be the cost of an individual item within the experimental unit which is inde- 
pendent of the cost of the experimental units, and let C, be the cost of the exper- 
imental unit independent of the number of individuals in the unit. Then, the 
Cost per treatment is 


rkC, + rC, = Ci. (П1-5) 


The total cost for an experiment involving v treatments is 2С, = C. The opti- 


Inum size for an experimental unit will depend upon the ratio of the costs. 
1-2, 
UNI METHODS OF DETERMINING THE SIZE OF THE EXPERIMENTAL 
Several ate Т 
Е ыа for determining optimum size and shape of plot have been 
but. only. two Men time [e.g., see references listed in 113, 193, 196, 258], 
Smith's Vari ese methods, "maximum curvature" and “Fairfield 
аПапсе Law,” are discussed here. 
Ш-2.21 Maxi 3 
b aximum 
method, so nameq because ҮЛЕ He method. The maximum curvature 
manner in which the optimum size of plot is 
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determined, has frequently been used to determine optimum plot size for various 
field crops. A field of the crop is harvested in small units of size x. The units are 
combined into various sizes. The sums of squares among the various plots for a 
given size are computed and divided by the degrees of freedom to obtain the 
variances. The various plot size means are also computed. From these results 
the coefficients of variation are computed. The coefficients of variation are 
plotted against the respective plot sizes. A freehand curve is drawn through the 
resulting coordinates, and the point of maximum curvature is located by inspec- 
tion, The optimum plot size is the one just beyond the point of maximum 
curvature. 

The method has two weaknesses: (i) the relative costs of various plot sizes 
are not eonsidered and (ii) the point of maximum curvature is not independent 
of the smallest unit selected or of the scale of measurement used. For certain 
types of material, e.g., where the smallest unit within the experimental unit is a 
hill of corn, an animal, a single determination or reading, etc., the choice of the 
unit is fixed. The maximum curvature method is useful for this type of material 
if the scale of measurement is fixed. It is not useful for material with an arbi- 
trary unit of observation (e.g., a fixed length of a row of a crop) or an arbitrary 
scale of measurement. 


Ш-2.2.2 Fairfield Smith's Variance Law. An empirical relationship 
between plot size and plot variance was developed by Smith [267]. Mahalanobis 
[204] developed essentially the same equation in his studies on sample surveys 
in India. The variance law is expressed by the equation, 


log V. = log Vi — b’ log q, (III-6) 


where У, is the variance of yield per unit area among plots (experimental units) 
of size z units, V; is the variance among plots of size unity, and b’, the regression 
coefficient, indicates the relationship between adjacent individuals or units. The 
limiting values of the regression coefficient are zero and one unless inter- 
experimental unit competition is present. If the experimental unit is composed of 
a random selection of z individuals, b’ = 1, and if the x individuals are identical, 
b’ = 0. In field work, there is likely to be a correlation between adjacent plots, 
resulting in values of b’ less than unity. Smith [267] computed b”s for thirty- 


. eight different sets of uniformity trial data and found that most of the regression 


coefficients fell within the range 0.2 to 0.8. 

If shape is ignored and if a block of v treatments is selected, the following 
equation defines the relationship between the constants b and (Vi), from a 
field with an infinite number of blocks and the estimates b‘and V; obtained from 
the data (see formulae (III-13) to (III-16)) : 

(= 27900). 
(V2; = ^q —l/9s * (1-7) 


where (V.), is the variance per unit of size z for а block of v treatments, b is 
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the value of b’ adjusted for field size, and (V1), is the variance for an infinite 
field with experimental units of size unity. The adjustments for b' may be ob- 
tained from a table prepared by Smith [267]. The cost per experimental unit 
as defined by equation (111-5) is 


aC, + C; = Cir. (111-8) 


From these relationships the optimum size of plot or experimental unit is 
found to be [267; see section II1-2.2.3]: 


= bC,/(1 — b)C.. (III-9) 


The formula for estimating the most efficient plot size for any given experi- 
ment, i.e., for a given number of treatments and a particular experimental 
design, is unaffected by the number of plots. The number of plots or experi- 
mental units per treatment, r, depends upon the area or material available 
or the degree of precision desired. 
If guard area is required around the experimental material and if the cost 
equation is of the form, 
C, + zC, + C,(A + Ba), (11-10) 


where C, is the cost of maintaining the guard or border material, В is the ratio 
of guard material to test material, and A is any additional cost, Smith [267] 
has shown that the optimum plot size is given by the equation, 


_ (C, + GA) 
*7 (0 - 0), + GB) (rin 


The value of b in the range .3 to .7 does not greatly affect the increase 
in cost or in variance when plots of size one-fourth to four times the optimum 
plot size are used. On the basis of these results, plot sizes of one-half to twice 
the optimum size result in small losses in efficiency. However, for plot sizes of 
one-fourth or four times the optimum size a loss in efficiency of about 20 per 
cent results because of the increased variance. 

Robinson ef al. [258] and Wassom and Kalton [304] have applied Smith's 
results to uniformity trial data on peanuts and on brome grass, respectively. 
These authors used percentage costs rather than actual costs. The use of per- 


centage costs may be misleading if the size of the smallest unit is not taken 
Into consideration. 


In animal experimentati 1 1 1 
ation the value of b for a particular trait sometimes 
may be obtained from p i 


Е genetic theory if knowledge of the relationships among 

ES i ; t g ships a 

of не experiment is available. In such cases the b values instead 

A ated b' values would be available and would not require adjust- 

T the size of the group of experimental animals. 
III-2.2.3 А 

Smith's Vario ne mathematical developments related to “Fairfield 
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mathematical results obtained by Smith [267]. These are not included in the 
previous section because they are not essential to users of the methods but 
may be of interest to students of statistics. 

In equation (П1-6), Б' is the regression of log s.? on log z;. Since the 
$22, the individual sample variances on a per unit basis, usually have variable 
numbers of degrees of freedom, and therefore different variances, a weighted 
regression is preferred. The weighted regression coefficient, b’, is obtained 
from the formula, 

Cw; log 5.2 log 2: — Nw: log sz? Dwi log zi/ Уш: (III-12) 

Xiwilog z)? — (ow; log т;)°/} ш; Н 
where ш; = the degrees of freedom associated with a given variance. The 
variance of log sz? to a first approximation! is 2/w;. The weights are pro- 


portional to the reciprocal of the variance. 
Equation (III-6) may be written in the form, 


log (V3. = log (Vi)n — b' log z, (III-13) 


where » — n/z — the number of plots per block, n — the size of the field, 
(V), denotes that V. is measured over v plots of size z, and z — the size of 
the plot. For an infinitely large field the law is 


log (Vj), = log (И). — blog т. (III-14) 


riance between blocks may be esti- 


Since the block is a large plot, the уа [ 
table for г blocks of size v 


mated in the same way. The analysis of variance 
may be represented as follows: 


Source of variation af ms, 
Among blocks r—1 yon 
Within blocks r(v —1) T 
Total rv — 1 (V3rv 


If r tends to infinity and if (11-14) is used to relate (Уш), and (Vz)r» we 


obtain 
H arae (r e 1)(У.:), = = v (Vi) ' 
ва (юр, = = DU == ata) - (FEA) C9) 
(01-15) 


For a given number of plots per block, v, and a given value of b the ratio 


(V2) /(V.),, is constant. 
i i in a Taylor series expansion. ` 
"Th i 'z;? is а roximated by the first term in a Тау e 
A Sue ae a pu is obtained from using additional terms in the Taylor 


Series expansion. Since the 5, are not independent and since b’ varies from year to year on 
the same field, even а graphical estimate of b may be sufficiently accurate. 
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Equation (III-13) is now rewritten as 
"T b 
log (У). = log (V), — blog z + log H, (Шаб 


where v = n/z is now variable indicating how V. as usually evaluated from 
uniformity trial data is measured over variable block sizes in terms of plot 
numbers. Since the value of b is affected by the size of field and since equation 
(11-16) exhibits a tendency for curvature for values of z/n > 0.1, Smith 
[267] provides corrections for converting the estimated regression coefficients 
into equivalent regression coefficients. For .01 > т/п > .001 the corrections 
for b' > .5 are negligible but become larger for b’ < .5. When .01 < z/n < .1 
the corrections become important for b’ < .7. 

In determining the optimum plot size, we wish to maximize the amount of 
information per unit cost. The amount of information is defined to be the 
reciprocal of the variance. We could also minimize the relative cost per unit 
of information. Using the variance obtained in equation (III-15) and the cost 
function given in equation (111-8), the cost per unit of information is 


(zC, + C,) _ (zC, + C,)(1 — 2-9) (И)., 
С doms = ОШ) 


Tf we take the derivative of C, with respect to z and set the result equal to 
zero, we obtain 


D ARA | (С, + 20) + z'G|-0o. qs) 


Solution of (111-18) for z results in equation (III-9). Likewise, if we use the 
cost relation described by equation (111-10) and proceed as above, the solution 
for z as given by (III-11) results. 


If the optimum plot size is not utilized, it is desirable to know what losses 
in efficiency result [267]. The cost per unit of information, equation (III-17) 
is a minimum when ж = 5C,/(1 — b)C, = optimum plot size. When another 
plot size, a, is used, the relative cost per unit of information is 


Gracia” O 1 - E (8) a - oi^ 


"AT p)g bass leo ү. po - Ines n (111-19) 


where In is the natural logarithm. Smith found that the last part of equation 
(11-19) 1s more convenient computationally. 


Ш-2.2.4 Some additional results. If the material making up the 
experimental unit is composed of discrete units (e.g., animals in a pen or 
plants ina plot), if the number of units, k, included does not alter V,’, and if 
the variance among experimental units irrespecfive of number of sampling 


Np 
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units is V,’, the variance of a mean of r experimental units is 
, 
ys H Vy + Wie (1-20) 


The optimum number of units, Ё, per experimental unit may be obtained 
with the procedure used to obtain equation (III-9). This is possible if the cost 
function is of the form rf(k), where f(k) is some function of k (e.g., equation 
(ITI-5)). If the cost function is not of this form, the cost per unit of informa- 
tion is not independent of the size of the experiment, and, hence, is not an 
appropriate criterion for fixing the size of the experiment. 

The cost function usually considered is equation (IIT-5), where the cost 
per additional experimental unit is С, + kC.. In this case, maximization of the 
amount of information per unit of cost (or minimization of the reciprocal), 
ay, = M. + BV yy + RC) (111-21) 

ere 
leads to the solution of the optimum number of sampling units (animals [199] 
or plants) per experimental unit, thus: 


k = /C,V.'/C.V;'. (П1-22) 
In connection with the above, we could minimize cost for a fixed V; ог we 
could minimize V, for a fixed C,. Both procedures would lead to the same 
answer for г and k. If C, is rf(k), then r is fixed, and there will be a unique value 
of k which maximizes (111-21). 
Another procedure is to use Lagrange multipliers to add the cost condition, 
differentiate, set equal to zero, and solve for the optimum plot size. For this 
case, we proceed as follows: 


к(2= л i МС, — rC, — тС) = 6. (111-23) 
90 У, » : 
= Еке ArC, = 0. (111-24) 
а eio e X d 
7 ER dr X») + МС, + kG.) = 0. (П1-25) 
90 

A = б‹— 76, — т, = 0. (III-26) 


Solving for & from equations (III-24 to 26) results in the solution given in 
equation (11-22). 

Koch and Rigney [188] suggested that use be made of the mean squares 
from experiments with three or more sources of variation to determine 
optimum plot size. For example, the analysis of variance for a "split plot 
design” (Chapter X) contains three sources of variation that could be used, 
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i.e., the replicate or block mean square, the error (a) mean square, and the error 
(Ы) mean square. Likewise, a one-restrictional lattice design (Chapter XI) 
contains three mean squares: replicate or complete block, incomplete block 
(eliminating treatment effect), and intra-incomplete block; these may be 
utilized in a manner similar to that for the split plot design. Before computing 
the regression coefficient for Fairfield Smith's Variance Law, it is necessary 
to reconstruct the various mean squares in order to make them comparable 
to calculations for uniformity trial data [188]. 

In making use of the method of Koch and Rigney, two considerations are 
of importance. The first has to do with the composition of the experimental 
error mean square; if this mean square contains a replicate X treatment 
interaction component of variance in addition to a component of variance due 
to plots treated alike, the experimental error will be larger than a comparable 
mean square obtained from uniformity trial data. A replicate X variety inter- 
action would invalidate this method for obtaining information on optimum 
plot size. The second consideration has to do with the selection of the replicate 
or blocks layout. Ordinarily an attempt is made to remove as much variation 
as possible with blocks. This is not the case for uniformity trial data. Thus, 
the optimum size would not be independent of the experimenter's ability to 
stratify the experimental material. 


III-3 Shape of Experimental Unit 


Theoretical considerations of the shape of an experimental unit may be 
found in the writings of Christidis [39, 40], Cochran [47], Ghosh [133], Jessen 
[166], and Taylor [286]. Jessen considered the shape as well as the size of the 
sampling unit in sample surveys; he obtained an empirical relation similar to 
Smith's [267]. 

For field work the results by Christidis [39, 40] and by Taylor [286] 
appear satisfactory. Taylor [286] presents a good literature review of the work 
on shape except that he failed to include the results by Christidis in 1939 [40]. 
As far as is known, shape of pen, paddock, or cage has not been studied. There- 
fore, the results discussed in this section apply only to field experiments. 

For small sizes of plots the shape may have little or no effect, whereas for 
large plot sizes the effect of shape may be considerable. The optimum size and 

shape of plot will be the one resulting in the smallest variation among plots 
within a block. i 

Given a plot size and number of treatments, Cochran [47] has graphically 
presented the case for choosing the optimum plot shape (figure III-1). He 
considers that we have v = 9 treatments to compare and we wish to select the 

plot shape with the smallest average experimental error variance when the 
direction of the fertility gradient is unknown. The two extreme shapes selected 
are rectangular (plan (a)) and square (plan (b)) as shown in figure ITI-1. If 
we consider the case for which a linear gradient exists, we may picture the 
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EM as shown. Suppose that the fertility gradient is parallel to AB or that 
eds e lie perpendicular to the gradient. The sum of squares among the 
s р ots would be 8c? + 600°. If the fertility gradient were parallel to AC, 
Eo ap age be Lee to the gradient and all plots would be equally 
a y the gradient. In this case th g i 

Metroid DEOR e sum of squares among the nine 


A 
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Plan (а) 


If c is the random variance within blocks independent of the shape of 


plot, the mean squares for the two arrangements are 
Average value of mean square 


Fertility gradient af plan (а) plan ()) 
Parallel to AC 8 e Ра ge 
Parallel to AB 8 с? + Se с? + Sgt 

8 в? + ae с + Hg 


Average 
plot shapes, rectangular and square 


Figure III-1. Mean es as affected by two 
[from Cochran, 41]. Ex 


If the plots were square, а8 in plan (b), the sum of squares among plots 

would be the same for both cases. The average mean square for thelong narrow 

Plots is smaller than for square plots. The actual situation is probably some- 

mus in between the average values given in figure пт, but the relationship 
etween the two plot shapes still holds. 

If the value of g is small, the selected plot shape is largely a matter of 
Preference, For g large, long narrow plots should be selected, since the addi- 
tional variance due to plots with other than optimum shape will need to be 
Compensated for by additional replicates. It may prove impractical to use 
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long narrow plots because of the nature of the crop, and it may be less costly 
to increase plot size or the number of replicates to obtain the desired degree 
of precision. ғ 

The above discussion has assumed that the size of plot is given. If it is 
desired to obtain the optimum size and shape simultaneously, the reader is 
referred to the papers by Jessen [166] and Taylor [286]. 


11-4 Replication 

Since variability is almost universal, replication (the repetition of the set 
of treatments in the experiment) is, or should be, practiced in nearly all experi- 
mental work. Fisher [126; Chapter I] states that the two conditions necessary 
to obtain a valid estimate of the experimental error are replication and ran- 
domization. Hence, replication is an important feature of experimental work. 
The need for replication was recognized by some workers as early as 1846 
[193, 196]. 

Despite the fact that replication is used in experimental work, its meaning 
is not always clearly understood. If a treatment is applied to absolutely 
homogeneous material, there is no variation from experimental unit to experi- 
mental unit; in effect, only one replicate from the population of possible 
replicates is obtained even though several observations are made. Several 
readings on a single plot (or pen) do not result in replication on a treatment 
which is subject to environmental variation but merely result in decreasing 
the variance due to measuring or technique errors; the plot to plot or pen to pen 
variation has not been reduced. A single greenhouse, nutrient tank, or oven 
constitutes one replicate from a population of greenhouses, nutrient tanks, or 
ovens. In perennial pasture experiments, years do not represent true replica- 
tions for perennial pastures grown on the same plot, whereas in annual pastures 
which are reseeded and reallocated every year the years constitute replication 
[46, 199]. 

{ A suitable criterion regarding the desired nature and extent of replication 
in experimental work might be formulated in the following terms. In the popu- 
lation to which we wish to make inferences concerning treatment differences, 
there will be a number of sources of variation affecting treatment differences. 
Some sources of variation affect the variability of treatment differences more 
than others. The variance of a treatment difference is reduced more rapidly by 
dncreasing the number of observations of treatment differences over the sources 
то asper y aci gie of variance in the error variance. de 
by incessus r. The ЕНУ REL, may be reduced by Angredsine de 

0 - value reduces both sources of variation, while 10- 
creasing k does not affect the quantity У„//г. 


П1-41 SHAPE OF THE REPLICATE 


Replicates or blocks should be set up to control as much of the variation 
as possible, resulting in the smallest experimental error variance. Not only 
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location but also the shape of the block is of importance in field trials. In the 
absence of knowledge concerning gradients the average sum of squares among 
blocks will be largest for square blocks (see figure 111-1); i.e., it is desired to 
have just the opposite effect for block shape as for plot shape. If any knowledge 
about variation is available, it should be utilized in setting up the blocks. If 
experiments are conducted on contoured land, it may be inadvisable to have 
square blocks if а block extends over more than one contour. À long narrow 
block may be the one containing the smallest within blocks variation. 

The above consideration is directly applicable to pasture experiments. Itis 
not known how shape of replicate affects the variation among cages or pens in 
other experiments on animals. 


Ш-4.2 SIZE OF REPLICATE 

The replicate size for a selected design is determined by the number of 
entries or treaiments and the plot or pen size. The greater the number of 
entries the larger the replicate size unless the plot or pen size is reduced to 
compensate for the increased number of entries. The upper limit on replicate 
size will depend upon the nature of the material and treatments being tested 
as well as the character being measure 


increases with the number of entries per replica 
to have too many entries per block. Of course, the additional variation may 
be compensated for by increased replication, but this is costly. Another way 
of dealing with large numbers of entries is to use incomplete block designs 
which allow for removal of incomplete block variation within replicates. A 
study of the particular material under experimentation will allow the experi- 
ae to formulate criteria for determining replicate size [199, 234, 258, 277, 
32]. 


d. Since the experimental error usually 
te or block, it is desirable not 


III-43. NUMBER OF REPLICATIONS 


The number of replications for an experiment will be determined by 


(i) degree of precision desired, ^ ial 
(ii) amount of variability present in the experimental material, 
(iii) available resources, including personnel and equipment, and 
(iv) size and shape of experimental unit. 


ts 

The degree of precision desired depends upon the panim ү рг үү: 
and the characters observed as well as the expected ius u a 
ment differences for a specified character. If differences are large, 


9f precision may be accepted. 
TM € degree of precision is de 
LOO SE mean (the variance 0 
"n ment difference is of importance, +7, 
tp ming the risk he is willing to take for @ es 
* stated size does not exist when it is actually pr 


lity associated with the 
Given that a specified 
i zes a decision con- 
experimenter makes a de 
Е ЗР егіле that а true difference of 
ent (type IE error) and 


fined to be the variabi 
f a treatment mean). 
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(ii) asserting that a difference larger than the stated value exists when no true 
difference exists (type I error). The degree of precision desired may be ех- 
pressed in per cent of the mean; i.e., it may be desired to have a standard 
error of a mean which is 3 per cent of the treatment mean. 

In connection with determining the number of replicates for an experiment, 
the experimenter should list the characters of interest with their estimated 
standard deviations. The number of replicates is determined for the character 
of interest. Tf several characters are of importance, the number of replicates is 
determined for the most variable character. This number is larger than the 
required number of replicates for all other characters. If the size of treatment 
differences of interest varies and if different levels of confidence are desired 
for the various characters, it may be necessary to compute the required num- 
ber of replicates for all characters individually and to choose a number large 
enough for the specified conditions. 

The degree of variability present in the experimental material depends 
upon the treatment tested and the character measured. Some characters have 
a low degree of variability relative to other characters. For example, some 
chemical measurements may have a low degree of variation, whereas others are 
quite variable. 'T'he coefficient of variation is often a useful measure for yield 
characteristics, but is not useful for comparing the relative variation of two 
characteristics or two types of material since it is not independent of the loca- 
tion of the origin of measurements. 

If the number of pens, the amount of material, or the area of land is fixed, 
the number of replicates is automatically determined. Also, the available 
personnel and equipment may be the limiting factors in determining the num- 
ber of replicates. 

However, if one wishes to compute the number of replicates required to 
detect a specified minimum difference between means at a specified signifi- 
cance level for a fixed plot size and shape or pen size, several methods are at 
his disposal. It is required that an estimate of the experimental variance be 
available and that the size of the minimum difference of interest be specified. 
Making use of the well-known statistic 4, the number of replicates may be 
computed from the formula, 


2,25: 
B @Ш-27) 


where s? is the estimated experimental error variance. tei 

a percentage level for the degrees of freedom associated E iu dis a 
specified difference. Using the number of replicates, r, obtained ет aie: А 
(1-27), а difference between two means as large as or larger than а 
difference d will be detected in approximately 50 per cent of the cases ПАТ] T 
50-50 chance of detecting a specified difference is too low for most experi ' 
situations. A number of procedures which give greater than a 50-50 Pre 


гс —— 
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detecting specified differences are available for determining sample size [20, 
21, 60, 147, 192, 229, 277, 280, 285; 296, Ch. 18]. A theoretical discussion of 
the problem may be found in a number of references [147; 175, Ch. 12; 207, 
Ch. 6; 285]. Three of the procedures are described below. 


Ш-4.3.1 The Harris-Horvitz-Mood method. Harris ef al. present a 
method and the tables necessary for determining the sample size (number of 
replicates) required to obtain significance.in a specified proportion of experi- 
ments, given that a difference as large as or larger than some values of d exists. 
It is assumed that the observations within each population are normally dis- 
tributed, that all observations are subject to a common variance, and that an 
estimate of s? of the variance with df; degrees of freedom is available. Given 
the value of s;?, dfi, d (the specified size of the treatment difference), and the 
values of ’ in tables III-1 and III-2, the number of replicates required to 
obtain significance at the o per cent level with a probability of approximately 
y per cent when the true difference is d, is calculated from the following 
formula, 

г = 2(K's/d)*(df: + 1), (11-28) 
where df; is the estimated degrees of freedom in the experiment to be con- 
ducted and k’ is obtained from either table III-1 (y = 80 per cent) or table 
III-2 (y = 95 per cent), corresponding to the values of df; and dfs. If r is too 
small to give the estimated df», a smaller value of df; is used to find k’ from the 
table. The following illustrative example indicates the procedure. From the 
data of example 11-1 the experimental error variance is estimated to be 
s? = 141.6 with 40 = df; degrees of freedom. Let us assume that d — 20 
grams of fat is the difference which it is important to detect if present, that 
it is desired to detect a true difference of d > 20 in y = 80 per cent of the 
experiments, and that six treatments are to be included in a completely ran- 
domized design. Then, from equation (111-28) we find 


. 2(141.6)(.330)2(60 + 1) . 5 
d 400 , 


where it is estimated that df» is 60 and the value К = .330 is read from table 
III-1 for df, = 40 (actually for 32) and df, — 60. Since the number of degrees 
of freedom df: is overestimated, we try df: — 24, and 

r= 20410 81504 + D == 5 replicates. 


If a smaller true difference is to be detected in y — 80 per cent of the cases 


or if y = cent more replicates are required. у | Г 
ні ie imde has no definite estimate of der 
the form 512, but he knows something about its order of magnitu e. In i s 
to relate the information concerning the upper and lower limits of a stan + 
deviation into quantities like sı and df;, Harris et al. [147] propose a rather 
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-05 for one-tailed tests, .10 for two-tailed tests* 


TABLE III-2. Values of k’; « 


15 


permission of the editor of the Journal of the American Statistical Association from the paper by Harris, M., 


. M., “On the determi, ation of sample sizes in designing experiments,” J. Am. Stat. Assoc. 43:391-402, 1948. 
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i irst the experimenter is asked for upper and lower limits, 
pM eid d UN Suppose he estimates 7 per cent and 12 
be cent of the mean to be the lower and the upper limits of the standard 
deviation for a mean of 30. Then, sz = .07(30) = 2.1 and sy = -12(30) = 3.6. 
The estimate of the standard deviation is taken as the average of the two 
estimates, or 51 = (sy + sz)/2 = (2.1 + 3.6)/2 = 2.85. In order to find df, 
it is necessary to have the experimenter place some confidence on his estimates. 
Suppose that his confidence is such that he feels it would be a fair 4 to 1 bet 
(i.e., he would bet either way) that the standard deviation from the experi- 
ment will fall within the range 2.1 to 3.6 four out of five times (80 per cent of 
the time). In order to determine df,, we compute the values of Mx uu X? us 


/xi 
10 .90 
for the various degrees of freedom. Тһе value agreeing closest to the ratio 
Su/$r = 3.6/2.1 = 


1.714 will indicate the degrees of freedom associated with 


sı = 2.85. For the present example the value of м; X? 10(1247) /ҳ2 „(124/) = 
У 18.549/6.304 = 1.72 is closest. to 1.714. Therefore, we can assume that 51 ls 
associated with df, — 12 degrees of freedom (see table IIT-3). With these esti- 
mates, we are now ina position to determine the desir 


timate of o? is required and the true 
n, it is necessary to state 
before estimating the re- 


ba = NES (11-29) 


r= ga; (а 


licates, v = number of treatments, и; — L = treatment 
the type I error, and 8 = size of the 
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TABLE III-3. Square roots of ratios of chi-square for various degrees of freedom to use in determining 
degrees of freedom for estimated standard deviation? 
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pAn extension of the table (table III) by Harris, Horvitz and Mood [147]. 
Values estimated from Figure 11-4. 
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To illustrate the use of table 1-4, suppose that c? = 14, the design is a 
randomized complete block experiment, » = 5, a = .05, and the и; — p are 
—5, —4, 0, 3, and 6. From equation (IIT-30), 


r(86) 
*= 204 


which for r — 4 is 12.2856. From formula (111-29), 


o = 202289 L 9 a7, 


which is larger than the value of $(1.88) in table III-4 for df, = 4and df, = 12. 
If we set r = 3, the value for ¢ is 1.920, which is less than the tabulated value 
of ¢ = 2.066 [from 192] for df, = 4 and df; = 8. Hence, we would use four 
replicates and would detect significance at the five per cent level for the 
effects specified in about 80 per cent of all experiments, 

As a second illustrative example, suppose that c? = 141.6, X (u; — р) = 


400,1 — в = 80 Рег cent, а = 5 per cent, v = 6, anda completely randomized 
design is to be used. Forr — 5, 


А. | 2(5)(400) a 
Ф = 2(6) 141.6) = 1.534. 


The value of Ф from the table 11-4 for 5 and 24 degrees of freedom is 1.64. 
Since the calculated value is below the ta 


bular value of 9, it is necessary to 
try a larger value of r, For r = 6, 


= 41.2(6)(400) 
$- V2(6)(141.6) ^ 1-681, 


which is larger than the tabulated 
freedom. H i 


= r(8.0714), 


— u)/e < 2, the а = 10 per 
апу smaller leve] of signi- 
Ш-4.3.3 Tuk 
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TABLE IIIJ. Values of ¢ when the probability of detecting the falsehood of the hypothesis tested is 1 — B 
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the paper by Lehmer, 


Annals of Mathematical Statistics from 
15:388-398, 1944. 


permission of the editor of the 
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tables of probabilities of errors of the 
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confidence interval of length less than or equal to 2d with an assurance of 
y per cent: 


T = 2(5l ar /d) Fi. (dfs, dfi). ПП-31) 
The values of F;. (df; dfi) are obtained from table 11-8, and the other sym- 
bols are those defined in section IIT-4.3.1. The value for Lay, 18 obtained from 
«as, 18 for two means, some adjustment 


7 = 8120..ау,2Р,_„ (dfs, dfi) /@, (11-32) 


where 52, F_,(dfo, dfi), and d are defined above and 
value of qa for v treatments and df, degrees of freedom in table 11-1. 


То illustrate the procedure, suppose that. Tukey's hsd procedure (section 
II-1.1.4) is to be used to comp Ong ? — 6 means, Suppose 


that s? = 141.6 with df, = 40 degrees of freedom and that d = 20. The values 
i value of r, Suppose that the 


where qa,ay, is the tabular 


Then, df, = 30, and from table II-8, F,0(30, 40df) is 1.54, From table ПА, 
dos for v = 6 and df, = 30 is equal to 4.30. From equation (ПІ-39), 


r= (141.6) (4.30)2(1.54)/400 = 10, 


Since the value for df. was underes 


timated, tryr = 9; 
Fıo(48, 40df) = 1.48, and r = 


; then, df, = 48, qa = 4.2 
141.6(4.2)2(1.48) /499 = i 


uld have the assurance t| 
95 per cent confidence interval wi Aine 
per cent of all experiments, 
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Whenever a plot or pen is sampled, a loss in information relative to the 
information obtained for complete recording results. Ап experiment may be 
designed with the optimum size of an experimental unit and with the required 
number of replicates and then be rendered inadequate because of sampling 
procedures. Before selecting a sampling procedure, the experimenter should 
determine if the suggested procedure results in the required degree of precision; 
if not, the sampling rate should be increased or sampling should be dispensed 


with. 
Ш-5.1 LOSS IN INFORMATION DUE TO SAMPLING 

Yates and Zacopanay [335] present the general theory for sampling experi- 
mental units. Since the sampling is within the experimental unit, the number 
of treatments or blocks need not be considered. The variation among the 
experimental units, excluding block and treatment variation, is considered in 
relation to that among sampling units. Consider that we have n experimental 
units each composed of h sampling units and that a random sample of k 
sampling units or individuals is selected from each experimental unit. The 


analysis of variance follows: 


at Mean square 
iati 
Source oivana da Average value Experimental 
Among experimental units n—l1 У, + kV,’ MS, 
Among sampling units — 

within experimental units n(k — 1) ү, М5, 


The variance of а plot mean of k sampling units is given by equation 
(11-1) as У, + V;'/k. An estimate of V,’ is obtained from the difference of 
the two experimental mean squares divided by k, ог (MS. — MS,)/k. 
The efficiency of sampling [335] relative to complete recording is the ratio 
of the amounts of information from the two schemes, 
VV + V/k) _ Vol БУРЬ 
I, У) 7 Vy F Veh йг-зз) 


The fractional loss in information is one minus equation (111-33). In experi- 
mental work the loss in information, L, due to sampling may be estimated 


from the equation, 


(III-34) 


L=1- 


1 1 

i8. — MS) + MS, T ( v Э MS. 
1 Пора h 
) 3048. — MS) + ЕМ, 


MS; 


where the factor (h — k)/h is the ordinary finite population correction term 
[88, 330]. If the L values from several experiments are to be combined into an 
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average, a correction should be applied, since the L values are Bed 'The 
quantity L(n — 3)/(n — 1) — L' is unbiased and may be averaged. 

Yates and Zacopanay [335] summarized the results from a number of 
experiments on cereals. The sampling rates in their experiments averaged 
about 6 per cent of the plot. The loss in information due to sampling relative 
to complete harvesting was 31.2 per cent, or about one-third. 'This means that 
50 per cent more replication would be required for similar experiments in 
which 6 per cent sampling was practiced relative to experiments where com- 
plete harvesting of the experimental unit was performed. They show that 12 
per cent sampling would have resulted in a loss in information of 18 per cent 
and that 18 per cent sampling would have resulted in a loss in information of 
13 per cent. 

Ineach type of experimental work, it is necessary to obtain factual evidence 
concerning the relative loss in information due to sampling. Armed with such 
evidence the experimenter may then decide, in light of cost and practical 
considerations, whether or not to sample the experimental units. 


П1-5.2 DETERMINATION OF THE OPTIMAL SAMPLING RATE 


Yates and Zacopanay [335] presen 
sampling and in complete harvesting o 
care of the experiment from harvest с 
dollars, it is possible to estimate the о] 
be that it is less costly to obtain a specified degree of precision by having 
more replicates and sampling than to have fewer replicates and use complete 
harvesting. To determine the optimal i i 
the total work (or cost) of the experim 
tion is a constant; i.e., the quantity o 


=з 


5 (III-35) 


experiment (Q = one in the case of com- 
where r is the number of replicates in the standard 
= «= sampling rate, L = loss in informati 

5 » айоп аз 
84), and f = ууу (V + hV,’). The total amount of 


Q1-D -1- i.f 


where Q — necessary relative size of е: 
plete harvesting; Q — т/го, 
experiment), 5/0 = k/h 
defined by equation (I. 
relative work is 


r 
nO эг kC.) == aQ T bS, 


(11-36) 
where C, and C, are defined in e 


ti м 
other symbols are defined above, quation (III-5), а 
We wish to find 


!Note that th 
tion. Likewise, th 


= Cp b= hC., and the 
values of 0 and S which will minimize the total amount 
€ correction term is the reciproca] of the first. 
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of work for a fixed quantity of information. Therefore, we wish to minimize 


the quantity, 
r To( Vp! + V/k) | 
ск лут vm Ii 


which is equal to 


$= 0 + [Les «i- 1} (Ш-37) 
If we differentiate (III-37) with respect to Q, S, and А (the Lagrangian mul- 
tiplier), set the resulting equations equal to zero, and solve for Q and S, we 


find 
o= 1- tA ERE (1-38) 
5-0 Jo, (11-39) 
and 


5 an e Н oe [ИСАР (III-40) 


The values for S and Q as obtained from equations (III-38) and (111-39) for 
f = 03116, a = 1, and b — 2 (these are values obtained by Yates and Zaco- 


22. 20 — .03116) _ 
Qo = 1 — .03116 + one e ue s = 1.215. 


S, = 1.215/7.886 = .154. 
Thus, zo = -154/1.215 = 12.7 per cent or approximately 13 per cent, which is 
the optimum sampling percentage. The total relative work required is 
aQo + Ь% = Co = 1.215 + 2(.154) = 1.52. 

A procedure using any other sampling rate may be compared with the 
optimum sampling rate in a very simple manner. Suppose that we wish to 
compare 51/0: = 6 per cent sampling with the optimum. Then for Q, = 1.49, 
S, — .0894, and the total work is (fora = 1 and b = 2) Qı + 28 = 6 = 
1.49 + 2(.0894) = 1.67. The additional work necessary to obtain the same 
information with a 6 per cent sampling rate as with the optimum sampling 

ate is (1.67 — 1.52)/1.52 = 10 per cent. 
j E 3 ie AU [335] present a simple graphical method for obtain- 
ing the values Qo, So, Co, Qu, S1 and Cı. The first step is to plot the function given 
in equation (III-35) with Q as the abscissa and S as the ordinate (see figure 
III-2). The second step is to consider the total work in an equation of the form, 


0+8 =. (IT-41) 


panay) are 


oss in information is 31.2 per cent but the 


= ; rrect | 
= 149; {Ке со ent with the results by Yates and Zacopanay. 


- — .328 
iQ, = 1/0 — 328) = 149: em 


figure 32.8 per cent was 
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i aQ/b = 0. The line parallel to the line S + aQ/b = 
Я Rene ta by equation (111-35) is the line of minimum 
work. The broken line in figure III-2 which crosses the Q axis at 1.52 is the line 
tangent to the curve and parallel to the line S T aQ/b — 0. The values 
Qo = 1.215 and S, = .154 are read from the graph as the coordinates of the 
point at which the broken line is tangent to the curve. 
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the total work is C, — bS,/a + Q, the point at which the straight line ту 
to S + aQ/b = 0 cuts the Q axis gi отера 


'omparison and the use of 8 


tors affecting the 
re discussed by Yates a 


tive to А 
па Zacopanay [335]. complete harvesting 
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The above discussion refers to the random selection of k sampling units 
from the k units in the whole plot. Other sampling procedures such as the 
stratified random and the systematic (a form of stratification) are usually 
more efficient than samples selected completely at random from the entire 
plot. The loss in information with such procedures will be less than the loss 
due to random sampling from the entire plot [335]. Homeyer and Black [159] 
present some systematic sampling procedures which tend to decrease the loss 
in information due to sampling a given fraction of the plot relative to the loss 
in information suffered by random sampling. Also, some data on this are pre- 
sented by Yates and Zacopanay [335], who give some results by R. J. 
Kalamkar on the investigation of the variation among different sampling 
units in a field of wheat. 

П1-5.3 SAMPLING FOR ONE CHARACTER WHEN А RELATED CHARACTER 
OF THE EXPERIMENTAL UNIT IS KNOWN 

In field experiments on cereals the total combined weight of grain and straw 
of the plot may be obtained without difficulty. A subsample of the total plot 
produce is taken and threshed ; the weight of the grain — Y is observed as well 
as the total subsample weight of both straw and grain = X. Since Y is more 
costly to observe than X, i.e., threshing is involved, it may be advisable to 
make use of the total produce weight of the plot as well as X and Y in estimat- 
ing grain per plot. The methods for performing this operation are described 
under the topic known as double sampling [88, 330]. Yates and Zacopanay 
[335] show that appreciable gains may be made by using total plot produce, 
whereas Cochran [49] observed somewhat smaller gains. Double sampling 
procedures are well known in certain types of research but could be used to 
advantage in a number of others. 

The practice of making "eye estimates" to determine the total yield for an 
experimental unit has been in use for some time. Beef cattle have been ap- 
praised for quality by buyers. Legume content of pasture mixtures is often 
estimated by eye. Field plots are graded for population size of insects or for 
intensity of disease infection. That these “eye estimates" are useful will not 
be questioned, but they should be used only in conjunction with actual 
measurements or counts until their validity has been established. Otherwise, 


serious biases may result [61, 330, Ch. 6; 334]. 


CHAPTER IV 


The Completely Randomized Design 


The preceding chapters deal with subjects related to the design of experi- 
ments. The remainder of the text is devoted to topics related specifically to a 
particular design or to experimental designs in general. The concepts related 
to factorial experiments (see Chapters УП, VIII, IX) are extremely useful 
for the development of the analysis for several of the incomplete block designs. 
rs to an arrangement of treatments 
rial experiments is usually discussed 


ast squares method of estimati 
IV-1 Applications of the C 
IV-1.1 INTRODUCTION 
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are randomly arranged over the whole of the experimental material. No effort 
is made to confine treatments to any portion of the entire area, material, or 
space. The number of repetitions for any one treatment may vary. The com- 
pletely randomized design is usually selected when the variation over all of 
the experimental material is relatively small. 

The completely randomized design is considered to be most useful in 
laboratory technique and methodological studies, in some greenhouse studies, 
in experiments on certain animals, etc. As an example, the treatments may 
represent a number of different injections into chickens of a certain age. All 
treated chickens might be placed in the same pen. Likewise, certain treatments 
on sheep might be applied and all the sheep kept together in one flock. The 
treated animals would be marked in an easily distinguishable manner. This 
particular setup or experimental procedure would allow for relative equality of 


environmental effects upon all material within the experimental area or space. 


IV-1.2 ADVANTAGES AND DISADVANTAGES 
The chief advantages of the completely randomized design are 


(i) It is easy to lay out the design. 
(ii) The design allows for the maximum number of degrees of freedom in the error 
sum of squares. 
(iii) A completely randomized design has the simplest analysis of all experimental 
designs subject to statistical analysis. 
(iv) Unequal number of repetitions for the various treatments may be included 
without unduly complicating the analysis in most cases. 


The chief disadvantage of the design is that it is usually suited only for 
small numbers of treatments and for homogeneous experimental material. 
When large numbers of treatments are included, a relatively large amount of 
experimental material must be used. This generally increases the variation 
among treatment responses. If the variation over the whole of the experi- 

it is possible to select more efficient designs 


mental material is relatively large, ible ; 
than the completely randomized one, resulting in more precise estimates of the 
treatment means for the same number of replicates. Completely randomized 


designs are seldom used for field experiments because experience has shown 
other designs to be more suitable. 


IV-L3 LAYOUT OF THE DESIGN 
f experimental treatments on 


The term layout refers to the placement of- 
the experimental site whether it be over space, time, or type of material. The 
whole of the experimental area or material is partitioned into а number of 
experimental units, say N. А random selection of n experimental units is 
made and one of the v treatments is applied to these units. A random selection 
ОЁ га of the remaining № — r1 experimental units is made and one, any one, of 
the remaining v — 1 treatments is applied to these particular units. This proc- 
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ess continues until all treatments have been applied. When each treatment is 
replicated an equal number of times, DURATA ck eT, mm T and Dor: = rv 
=N experimental units. Unless practical limitations dictate otherwise orunless 
some treatments are more variable than others or are of greater interest, equal 
replication on each treatment should be made. . - 

Ап example of the layout of a completely randomized design for the five 
treatments А, B, C, D, and E replicated four times each is given below: 


Such an experiment might be designed for twenty pots on a greenhouse bench, 
8 series of twenty soil analyses, the twenty animals in a feeding trial, twenty 
cakepans in an oven, twenty successive bakings of single cakes in an oven, 
records on five litters of four pigs each, or other experimental material. 


IV-1.4 ANALYSIS FOR EQUAL REPLICATION OF THE TREATMENTS 
The statistical anal 


Source of variation df 88 
Among treatments (у – 1) Ухт X.2/ry 
i 
Among experimental units 
within each treatment Wied) > | Хг Б хыл 
"Total (corrected for 
the mean) (rv — 1) EEX XA 
TA 


where X;. represents the ith tr 
measurement, count, or 
ment, and X., 


— 
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e.g., Snedecor's F value, the standard error of a treatment mean, the coefficient 
of variation, the confidence intervals, etc. 


Example IV-I. A set of data involving four “ tropical feedstuffs” fed to chicks [275] 
was selected to illustrate the computational procedure and the calculation of various 
statistics for the completely randomized design. Although the details of the layout 
of the experiment are not given, the completely randomized design is assumed and is 
necessary to evaluate treatment differences in the manner presented. For this design, 
suppose that we have twenty chicks which are from a single uniform lot. We draw a 
random sample of five chicks to receive feeding treatment A. A second random sample 
is drawn for treatment B, and so on for treatments C and D. The twenty chicks are 
treated alike in all respects except for the feeding treatments. The differences between 
lots of five chicks represent only feeding treatment differences plus random sampling 
differences. If the four lots of five chicks each are from different breeds of chickens or 
are grown under different environmental conditions, the differences between the lots 
will contain not only treatment and random sampling components of variation but 
also a breed or environmental component of variance as well. If, on the other hand, the 
lot differences represent only feeding treatment differences plus a component due to 
random sampling from a homogeneous population, then the analysis of variance 
presented in table IV-1 is appropriate. First compute the totals X;., X»., Xz., and 
X4. for the four treatments А, B, C, and D. The treatment means 2;. are computed 


5 
next. The individual sums of squares Ьу treatments, > Х:2, need not be written 
jal 


down but may be obtained as a single figure in the calculating machine unless it is 
desired to compute the individual variances, s2, or to use them in checking computa- 
tions. The individual correction terms X;.2/5 also need not be written down but may 
be accumulated in the calculating machine. 

The sum of squares among feeding treatments is obtained Ь 
correction terms, (X12 + Xs? + X? + X42)/5 = 9592.2 + 
169886.2, and subtracting the correction term for the experiment, 
= 143651, or 169886 — 143651 = 26235. 

The sum of squares representing the 
obtained either by adding the four sums 0 


y adding the individual 
-.. + 101959.2 = 
X.2/rv = 1695?/20 


variation among chicks within treatments is 
f squares 742.8 + 3850.0 + 2093.2 + 4872.8 
—11558.8 or by subtracting the treatment sum of squares (uncorrected for the mean) 
from the total sum of squares (uncorrected for the mean), which is equal to 181445 — 
169886.2 = (181455 — 143651) — (169886.2 — 143651) — 37194 — 26235.2 — 11558.8. 

Before pooling the individual within lot sums of squares into a sum of squares and 
obtaining a generalized error variance, 52, such as 722.42, we should know something 
about the homogeneity of the individual variances. Some such test as Bartlett's chi- 
square test may be used to test for homogeneity [14; section 11-3]; thus: 


v v 
; — ; loges, IV-1 
2@ — 1df) = Zdi loge 5° à log.s (IV-1) 


which for this example is equal to 


2.3026 {16 logio 722.42 — 4(1од 185.7 + logio 962.5 + logis 523.3 


+ logo 1218.2)} = 3.49. 
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replicated an equal number of times, Шаки етага == 7 and Уг; = rv 
= N experimental units. Unless practical limitations dictate otherwise or unless 
some treatments are more variable than others or are of greater interest, equal 
replication on each treatment should be made. : | 

Ап example of the layout of a completely randomized design for the five 
treatments A, B, C, D, and E replicated four times each is given below: 


Such an experiment might be designed for twenty pots on a greenhouse bench, 
a series of twenty soil analyses, the twenty animals in a feeding trial, twenty 
cakepans in an oven, twenty successive bakings of single cakes in an oven, 
records on five litters of four pigs each, or other experimental material, 


IV-1.4 ANALYSIS FOR EQUAL REPLICATION OF THE TREATMENTS 
The statistical anal 
completely randomized design is present 


Source of variation df 55 
Among treatments (v-1) DXi2/r = X.2/ry 
Among experimental units 
within each treatment Wr i | Ух "i Xr 
Total (corrected for 
the mean) (rv — 1) EEX — X.2/ry 
US 


where X;. represents the ith tre: 
measurement, count, or 
ment, and X.. 
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of variation, the confidence intervals, etc. 


Example IV-I. A set of data involving four “ tropical feedstuffs” fed to chicks [275] 
was selected to illustrate the computational procedure and the calculation of various 
statistics for the completely randomized design. Although the details of the layout 
of the experiment are not given, the completely randomized design is assumed and is 
necessary to evaluate treatment differences in the manner presented. For this design, 
suppose that we have twenty chicks which are from a single uniform lot. We draw a 
random sample of five chicks to receive feeding treatment A. A second random sample 
is drawn for treatment B, and so on for treatments C and D. The twenty chicks are 
treated alike in all respects except for the feeding treatments. The differences between 
lots of five chicks represent only feeding treatment differences plus random sampling 
differences. If the four lots of five chicks each are from different breeds of chickens or 
are grown under different environmental conditions, the differences between the lots 
will contain not only treatment and random sampling components of variation but 
also a breed or environmental component of variance as well. If, on the other hand, the 
lot differences represent only feeding treatment differences plus a component due to 
random sampling from a homogeneous population, then the analysis of variance 
presented in table IV-1 is appropriate. First compute the totals X;., X»., Xz., and 
Xa. for the four treatments A, B, C, and D. The treatment means z;. are computed 


5 
next. The individual sums of squares by treatments, 2; X, need not be written 
PES! 


down but may be obtained as a single figure in the calculating machine unless it is 
desired to compute the individual variances, з, or to use them in checking computa- 
tions. The individual correction terms X;.2/5 also need not be written down but may 
be accumulated in the calculating machine. 

The sum of squares among feeding treatments is obtained by 
correction terms, (212 + Xa + Xs? + X42)/5 = 9592.2 4- 
169886.2, and subtracting the correction term for the experiment, X.. 
— 143651, or 169886.— 143651 — 26235. 

The sum of squares representing the 


adding the individual 
.-- + 101959.2 = 
2/rv = 1695?/20 


variation among chicks within treatments is 
obtained either by adding the four sums of squares 742.8 + 3850.0 + 2093.2 + 4872.8 
=11558.8 or by subtracting the treatment sum of squares (uncorrected for the mean) 
from the total sum of squares (uncorrected for the mean), which is equal to 181445 — 
169886.2 = (181455 — 143651) — (169886.2 — 143651) = 37794 — 26235.2 = 11558.8. 

Before pooling the individual within lot sums of squares into a sum of squares and 
obtaining a generalized error variance, s?, such as 722.42, we should know something 
about the homogeneity of the individual variances. Some such test as Bartlett's chi- 
square test may be used to test for homogeneity [14; section 11-3]; thus: 


v v 
1 = B A IV-1 
x2(v — 14) = >; df; log. s 2 df; loges ( ) 


which for this example is equal to 
2.3026 (16 logio 722.42 — 410810 185.7 + log 
+ logio 1218.2)] = 3.49. 


10 962.5 + logio 523.3 
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i i i i feeding mate- 
IV-1. Weight gains of baby chicks fed different 
AE Mis composed of tropical feedstuffs 


Feeding treatment 


Totals = Xi 1695 

Means = XC. 84.75 

I x 3 181145 

x2 169886.2 
145 

i хе, -x.s 11558.8 


Analysis of variance 
DEED E, 


Source of variation af 


at эз ms 
Among feeding treatments - 26055.2 8745.07 
Among chicks within 

treatments 16 11558.8 722.42 
eK 
Total 19 37794 
Correction term = X? о 1 143651 


Since x*(3df) > 3.49 has a rather la 


rge probability of occurrence (figure 11-4, Р = 32), 
we need not correct the chi-square 


value by the following formula: 
Хе = MM — (IV-2) 


1 1 1 
Шаку = ша Акту ыш Иш 
*39- Jb df zu 
which for the above example is 


3.49 144(3.49) 

а eee 
L+HEFE+ETE— wy ds = 216 

If the individual Variances, 


3 m 

comparing the treatment means, ži.. бап Square equal to 722.42 in 
Snedecor’s F test is used to test the hypothesi А 

mates of the same parameter, say р, © pothesis that the treatment means are esti- 


874: 
FG, 16df) = 790-55 = 12.1. 
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A value of 12.1 or larger with 3 and 16 degrees of freedom associated with the mean 
squares in the numerator and in the denominator, respectively, is exceeded less than 
one per cent of the time (Fo(3,16df) = 5.29 from table II-8) in random sampling from 
homogeneous populations. Therefore, we would reject the hypothesis of no difference 
among the means. Lacking any information concerning the nature of the treatments 
the means are considered to be a single set of means and are ranked from highest to 
lowest, and one of the tests presented in section II-1 should be utilized. 

Various statistics may be computed from the analysis of variance table (table IV-1) 
for the chick data. The standard error of a single observation, or the standard deviation, 


is the square root of the error mean square, A/S! = 4/122.42 = 26.9 = s. The stand- 
ard error of a treatment mean is s//r = sz = V/722.42/5 = 12.0. The standard 


T y 1 2(122.42 ү 
error of the difference between two means 15 $4 Jes Eis rm Al ( 22 -17.0-sà. 


difference is бвлвау5а4 = 2.12(17.0) = 36.0. The coefficient of 
26.9/84.75 = 32 per cent; to judge whether or not 32 per 
her knowledge and experience on experimental data 


The least significant 
variation is equal to s/f = 
cent is unusually large requires ot! 
of this nature. 


IV-LS ANALYSIS FOR UNEQUAL REPLICATION OF THE TREATMENTS 


Some experimenters are not always so fortunate as to obtain equal num- 
bers for each class. If the experimenter is working with animals, some of the 
experimental animals may become sick or die, leaving the experimenter with 
unequal numbers. In the laboratory, ап assistant may unwittingly bulk items, 
may forget to record the data, or may inadvertently lose some results in one 
way or another, and the experimenter is left with unequal numbers of indi- 
viduals, Also, it may be necessary to start an experiment with unequal numbers 
since the experimenter may wish to use all available material and does noe 
have equal amounts for the various treatments, The analysis e E 
numbers in a completely randomized design is little affected; the only rea 


effects are that comparisons among treatments with РЕ pu ae bs 
precise than among treatments with larger numbers an t р UN a rae 
slightly more difficult. The symbolical representation ү t Ў а: 
ance for a completely randomized design with unequal replicatt : 


ee E a c 


df 


У) Xi2/r, — х.г/ў 


М izi 


Among treatments 


35 Iii IP Xi? — хына} 


Within treatments ia 


Total (corrected for mean) xig eu 
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The symbols are defined in the previous sec 


tion; the number of replicates, ri; 
differs from treatment to treatment, 


Example IV-2. A variety, 109, of &uayule (a rubber-producing plant) was planted 
in a field. After the plants we i eee plant) was р 


1955 


99.7 26.4 91.4 
2 
115 250545 162652 11545 564321 
г 
LEM 560995.Т 149201.1 8374.1 518570.9 
s- 


8.0 


67566.7 


55185. 
2436.7 
5$5130.0 


5150.1 


113316.8 
451004.2 


Correction for mean 
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The analysis of variance for these data is presented at the bottom of table IV-2. 
The total sum of squares is obtained by squaring all individual weights and subtracting 
the over-all sum squared divided by the total number, 

o or 
УХ? — Х.2/У: 


їе] је] 
c2 
= 582 + 109 +--+ + IP H 657 — 5938 


= 564321.0 — 451004.2 = 113316.8 with 53df. 


The sum of squares among types with 2df is 
y Xi 4 ТЕЛЕ 3122? , 1496? P arm N 4935? 
del Te Улм 27 15 12 54 
= 518570.9 — 451004.2 = 67566.7. 


The sums of squares among plant weights within normals, offtypes, and aberrants are 
obtained from the formula, 


» Xi? — Xr 
j=l 


and are given in table IV-2. 
The sums of squares for the two orthogonal comparisons given in the anaylsis of 


variance table are 


3127 , 1496 4618 _ 436,7 _ [15(8122) — 2701499) 
27 15 42. 5 "7 97(15)(27 + 15) 
and 
4618: , 317 4935? — _ [12(4618) — 42(317) 
Ip EE erc) EA 42(12)(54) 


The standard error of a mean (table IV-2) is computed from the formula, 
DXi? — Xr 
2 7 s 
xm ri(ri — 1) 

The individual within-type variances were expected to be different. The means 
might be expected to be linearly related to the variances in material of this type. 
Bartlett’s test for homogeneity of variances results in the following chi-square value 
(formula IV-1 and IV-2): 

x? = 2.3026(51 logio 897.1 — 26 logio 1128.8 — 14 logy 959.4 — 11 logio 270.1) = 6.29; 


» 6.29 mo es 
Xe [teeta ta r) 1030 à 
with 2df. The probability of obtaining a chi-square value as large as or larger than 6.11 
occurs less than 5 per cent of the time in random sampling. Hence, it is concluded that 
the variances differ. The x2 value = 6.11 with 2df may be partitioned into 2 single de- 
gree of freedom comparisons with corrected chi-square values of 0.12 and 5.99 for 
comparisons among plant variances for normals versus offtypes and for the pooled 
among plant variances for normals and offtypes versus aberrants. Apparently the varia- 
tion in individual plant weights is much smaller for aberrants than for the other plant 
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types, but the variation among plant weights for normals and offi types is approximately 
equal. One could use the pooled within-plant variance, 1069.5 with 26 + 14 = 40 
degrees of freedom, for testing the difference between the means of the normals and 
offtypes, thus: 

2436.7 

ово 25 

The corresponding F value for 1 and 40 degrees of freedom at the 5 per cent level is 
4.08; we conclude that the difference in means of normals and offtypes could be ob- 
tained fairly frequently in random sampling. In view of the evidence the experimenter 
may not wish to assume that the error variances for normals and offtypes are estimates 
of the same parameter. He may have observed that the weight for the last offtype 
plant, 22, was unusually low and that for the last aberrant plant, 65, was unusually 
high. These values tend to increase the variances of both types. A copying error was 
suspected, but a check showed this not to be the case. A misclassification was suspected 
but could not be verified. In view of this, then, one may wish to assume that the 
variances are different. The means may be compared by using a ! test with the correct 
degrees of freedom. An approximate significance level of £ may be computed from a 


formula given by Cochran and Cox [60, р. 92; 273, p. 84]. The computed о per cent 
t value is obtained from the formula, 


tar —1d/83,? Ё... ars 
ta feta tet avs 
From formula (IV- 


3) the 5 per cent significance level for comparing normals and off- 
types is 


Tun fion 0594) 
"eed 21 15 


Wass = 2.11, 
27 15 
hich ii imatel ival i 3 
ee an ety equivalent to fos with 17 degrees of fi reedom. The experimental 
(2186-997 _ 159 


== == = 155, 
1128.8 959.4 1028 
21 15 
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The 5 per cent level of é for this comparison is 


214519594) 7 n 
= 2.16, 


959.4 , 270.1 
15 12 


los = 


and the 1 per cent level of t is 


n т 10612191) 


а = 959.4. , 270.1 


15 12 


The mean weight difference for the two types of plants, offtypes and aberrants, is much 
larger than could be logically attributed to chance sampling fluctuations. 
The coefficient of variation for the whole experiment has little meaning, but for 


illustrative purposes it is equal to 


The coefficients of variations for the three types of plants, normals, offtypes, and 
aberrants, are 


У1128.8 _ 99 per cent, 


115.6 


м 959.4 
7394 7 31 per ceni, and 


A/ 210.1 


TE T 62 per cent. 


The above computations have been carried through for illustrative purposes. In 
practice, one would select a transformation to stabilize the variances. In this case the 
individual variances, sj, are almost ten times the corresponding means, Ži. From 
equation (II-7), we note that the square root transformation js indicated. The square 
roots of the individual plant weights (table IV-2) are given in table IV-3. The means, 
variances, and the analysis of variance are presented in the table. The individual 
variances are remarkably uniform and appear to be unrelated to the means. This allows 
use of the generalized error variance in making the various tests of significance and in 


computing various other statistics. | ^ 


portant that the assumptions under! 
cance be fulfilled. If the us c 
that the error deviations (the devi 


means in the completely randomize d 
uted about zero with a common variance [100]. Cochran [54] discusses the consequences 
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when the assumptions are not true. If anormality is not serious, the usual tests of sig- 
nificance are little affected. Also, a transformation of the data to another scale of 
measurement may satisfy the requirements for making tests of significance [15]. 


Further discussion of the validity of tests of significance on data from experiments is 
given in Chapter II. 


TABLE IV-3. Square roots of dry weights of shrub in table IV-2 


Normals Offtypes Aberrants Total 

7.616 10.000 10.1h9 5.851 

10-440 10.817 9.165 3.464 

9.327 9.055 9.381 һ.!]2 

10.050 11.555 10.440 2.256 

10.247 15.115 11.576 6.98 

9.695 11.555 10.296 5.657 

12.995 12.845 9.274 4.583, 

11.87} 12.247 12.207 4.359 

10.583 10.770 8.000 1.899 

10.198 10.95} 10.954 4.472 

7-616 15.856 10.392 4.125 

9.899 10.817 9.055 8.062 

10.296 - 10.585 E 

10.954 - 11.358 - 

8.062 ч 4.690 Ж 
Totals, X, 287.322 147.520 59.086 495,028 
Means, x, 10.64 * 9.83 4ge 9.15 
К 1122 1896 317 4935 
2 
XP dn 2051.55 1450.81 290.95 4799.29 

2 _ yd 

Л 6.5 45.19 26.07 135.71 
2 


5.25 2.51 


Analysis of variance 


Source of variation 


Among types 


N vs 0 
N * 0 vs 2A - 
Within types 2 


Total 


Correction for mean 
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For this particular situation an additional line appears in the analysis of 
variance table, 
V uu c Uu m OS 


Source of variation df ss 
С Б 
=1 та oed 
Among treatments v >; SE ET 
Experimental error vlr- 1) pjr _ Хы 
TIP К tk 
; 14 Xy? 
Sampling error rv(k — 1) >> УХ — =e 
1 Е 
р. 
Total rvk — 1 »»»»: ELS 


а ЗЕК оъ E esce 


There аге k samples per experimental unit. Each treatment is applied to r of 
the rv experimental units. The other symbols represent sums of quantities. 
For unequal numbers the values r; and k;; vary, as denoted by the subscripts. 
The treatment degrees of freedom and sum of squares may be partitioned 
into meaningful components as in example IV-2. Likewise, other sources of 
variation may be partitioned provided that the partitions are meaningful. 

For certain types of experimental material the treatments form parts ofa 
hierarchy of items. The v treatments may be composed of s species, n; strains 
within each species, and rij individuals per strain. The analysis of variance 
table illustrating the sources of variation and degrees of freedom is 


df ss 


Source of variation 
Treatments v—1 
s—1 DXi-2/ri. — Х..3/.. 


Among species 
v-s xixxmn - XA. 


Among strains within species 1 


т.—У Yr|rxe - хазл 


Within treatments 


where r;. = Jory and r.. = XXe 


discussed by Snedecor [273], Cochran and 


3 
I f the above types are 
Ana [132], Kendall [179, p. 173-181], Mann 


Cox [60], Crump [78,80]. Ganguli 
[207], Wald [299], and Wilks [308]. 
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IV-2 Least Squares Estimates and Expectation of Mean 
Squares 


Tt is possible to use a variety of estimates of the population parameters in 
summarizing the data from a sample. The estimates will have various proper- 
ties, such as minimum variance, biased, unbiased, etc. Of the several estima- 
tion procedures, the least squares estimate of a population parameter is "best" 
in the sense (i) that the sum of squares of the differences between the observed 
values and the least squares estimate is a minimum and (ii) that among all 
unbiased estimates which are linear functions of the sample data, the least 
squares estimate has the smallest sampling variance. À number of other 


methods of estimation, maximum likelihood, moment, minimum chi- 
etc., are available and may or ma 


as the method of least squares. 


square, 
y not give the same estimate of a parameter 


Before obtaining the least Squares estimates and their variances for а 
completely randomized design, the least squares estimate, f, of the mean, и, is 
obtained. The least squares estimate of the linear regression coefficient ів 
presented after the sections on expectations of mean squares for completely 
randomized designs. 


ІҮ-21 SAMPLES OF n OBSERVATIONS 


IV-2.1.1 Least squares estimate of the mean, 
is designated as the parameter и and the le 
desired to choose such that the sum of 
minimum. Graphically, this may be represe 
which the tangent to the curve has a slope 


The population mean 
ast squares estimate as f. It із 
squares of the deviations from Risa 

nted as the point on the curve at 
of zero (figure IV-1). Borrowing a 


= 
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Equation (IV-4) is set equal to zero, and the solution for и is obtained, thus: 


Ф050 a3 0-9 = 0. av-s) 
Therefore, p= zx dy (IV-6) 


The symbol 2 replaces и whenever the n individuals observed represent a ran- 
dom sample of the total population. If all individuals in the population were 
observed, и could be calculated exactly, and there would be no need for esti- 
mating the population mean. 

From equation (IV-6), we see that the ordinary arithmetic mean is the least. 
squares estimate of the population parameter д. This makes the sum of the 
sample deviations, УХХ: — à), equal to zero. Also, the sum of squares of the 
observed values X; from = is a minimum. 

The mean, 2, is subject to sampling variation. If another random sample 
were drawn from the population and an estimate of ш obtained, it is highly 
improbable that these values would be identical. The redeeming feature is 
that the estimates fall within a calculable interval in a specifiable proportion 
of the cases. To calculate the interval, an estimate of the variance of the popu- 
lation is needed. By the method of moments, it is possible to calculate the 
sampling variance. 


IV-2.1.2 Variance of the sample mean. The first moment about 
the origin is defined as E(X;) = p, which is the expected value or average 
value of any randomly drawn element from the population under observation. 
The above agrees with the least squares solution for и; i.e., if all items in the 
population were averaged, the result would be д. The second moment about 
the origin is defined as E(X?), and the second moment about the mean is de- 
fined as E(X — и)? = o°, where о? is defined as the variance. In the normal 
distribution, p + « represent the points of inflection on the normal curve. 


Now the following is true: 
E(X — uy = E(X — 2Xy + ш) = EX? — EX + p 
= EX: — (EX) = о, (IV-7) 
expected value of a sum equals the sum of 


by use of the theorems (i) that the t 
the expected value of a constant is the 


the expected values and (ii) that 


constant. З 
be represented as the mean of the population plus 


The observation X; may Х е t ia 
some random deviation (plus or minus) in the form of a linear model, 
X: = p+ éi (IV-8) 


which is estimated by e; = X; — 2. 'The mean 
sented by the linear model, 
Le. (IV-9) 


n 


where e; is a random deviation 
of n observations may be repre 


=н + 
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By definition the variance equals 


Е[Х; — u} = Efu + e — и]? = E(e?] = o 2 = o. (IV-10) 


. Therefore, 
4 Jor- r] = af Exe 827] 
= DF + «P - lip + Бе] 
= п + ne? — ni? — cè = (n — Ye, (IV-11) 
if 


Ере] = 0 = Е[ее], j < i, and E[e?] = cè. 


The deviations are random independent variables. From equation (IV-11), 
we note that division by n — 1r 


esults in an unbiased estimate of the popula- 
tion variance e? = g2., 


"The variance of the sample mean is 


1 а 
= Elea t e ++] = = 


"Therefore, 


(IV-12) 
the variance of the sample mean is estimated from the f. 
formula: 


ollowing 
„9. M У(Х; tori zy 

б: = EE CREE = зг. (ТУ-13) 

ple may be viewed in an 

<= = an estimate of [a 


The variance of the sam other manner. 'The sum of 
squares due to the mean, has the average value 
Д у= ЕОС): ^ EG + e)P _ ny? + по 
Е(УХ)= E X Ty UR eR LE CR one, (IV-14) 
where 22 X; isthe ordinary correction for the mean. Th 
has the expectation: ны К 


ЕУ X? = DE + «у 


= nj? + под, (ГУ-15) 
"Therefore, the expectation of the resid 1 
mu map ual sum of squares after fitting the 
BED Ret Tyee eas (n. Dee, (IV-16) 
with n — 1 degrees of freedom. 
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IV-2.5 COMPLETELY RANDOMIZED DESIGN WITH AN EQUAL NUMBER 
OF REPLICATES PER TREATMENT 


For a completely randomized design, assume that the yield of an observa- 
tion is represented by the linear model, 


Xi = ui + є; = treatment mean + random deviation 
= p + (ш — u) bey = et Ti t es (IV-17) 


where и represents the population mean, т; = и; — и = additive effect of the 
ith treatment or the partial regression coefficient for the ith treatment, ei 
represents the random deviation of the jth member of the ith treatment, and 
i = 1, 2, +++, (0 = number of treatments). The subscript j = 1, 2, +++, r«(rs 
= number of replicates or individuals in the ith treatment) denotes the 
particular individual in a treatment. For the first situation, consider that all r; 
are equal. 

IV-2.2.1 Least squares estimates of effects. The partial regression 
coefficients to be estimated are и, ту, 72, ***, т. In order to obtain a unique 
solution, the restriction that the sum of the t; (I; is the least squares estimate 
of т;) must equal zero, 

E 
Èt: = 0, (ТҮ-18) 
is imposed, since we postulate that Ё[т;] = 0 in the population. 

The sum of squares to be minimized is! 


R-232(X;—B-—1U5 (IV-19) 


which is the within treatment sum of squares. The partial derivatives of 
equation (IV-19) with respect to д and the /; are set equal to zero; thus: 


aR = УУХ —a-—t)=0, (1V-20) 
aR _ зуу -а- 0) = 0, (ТҮ-21) 


OR _ У(Х — h — 0) = 0, (1V-22) 
j ^ 


E =- (Xu — A — te) = 0. (1V-23) 
91, 7 
Strictly speaking, the sum of squares to be differentiated should be 22 (Xi; — и — 74), 


a g ‚т, Ta es Toe The resulting equations are 
and the differentiation should Ре ye 55 edd "estimates of the parameters which satisfy 


th and £i, hi, ls ^ D 3 \ Nus 
oen шо P = operational standpoint, we use the estimates which minimize 
equation (IV-19). 
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Fractional replication of factorials may be combined with confounding. 
For example, a half replicate of a 2° factorial may be arranged in two blocks 
of sixteen treatments each [60, p. 192] or in four blocks of eight treatments 
each [173, p. 267]. Other schemes are described by Finney [121], and other 
arrangements are given by Kitagawa and Mitome [187]. 


IX-6 Missing Data 


If an experimental unit has been damaged or destroyed in a completely 
confounded incomplete block design of the type described in example IX-1, 
the yield may be estimated by the ordinary formula for a randomized com- 
plete block design (see Chapter V). Only the incomplete blocks in which the 
treatment appears plus the incomplete block in which the yield for this treat- 
ment is missing are used in the formula. These blocks replace the replicates 
in the formula; only the treatments appearing with the missing one are con- 
sidered. 

In partial confounding the situation is more complex [60, sec. 6.34; 175, 
sec. 15.8]. Cochran and Cox discuss three situations for partial confounding; 
i.e., partial confounding in the factorials of the type р"; partial confounding 
in factorials of the type k X p" X q”; and for a single replicate of the р" 
factorial. The formula may be obtained by differentiating the algebraic ex- 
pression for the residual sum of squares, setting the resulting equation equal 
to zero, and solving for the missing unit; or a covariance analysis may be per- 

formed with zeros used as the covariate of all values present and a one used 
as the covariate for the missing unit (see Chapters V, VI, and XVI). 


СНАРТЕК Х 


Factorial Experiments with Main Effects Confounded 
— Split Plot and Split Block Designs with Variations 


X-1 The Split Plot Design 


X-1.1 INTRODUCTION 

The very nature of the levels of one factor, say a, may be such as to 
exclude the use of small plots or units, or the experimenter may know that the 
levels of the factor usually differ in yield. In such circumstances the levels of 
factor a (а, аз, «++, а,_1) may be laid out in relatively large units (whole plots) 
designed as а randomized complete block, latin square, or other design. Since 
the whole plots are large by necessity or by design, it may be desirable to com- 
Pare levels of another factor, say b, on each plot, the several levels—bo, bi, ** *, 
be-1—being allotted to the split plots or sub-plols of each whole plot at ran- 

om. Such an arrangement is a factorial arrangement of the factors a and b 

with p and q levels, respectively, in which the main effect A, with p — 1 de- 
grees of freedom, is completely confounded with incomplete block or whole 
Plot differences, Thus, the split plot design is an ipcomplete block design. 

The whole plot treatments may themselves represent a factorial arrange- 
ment, say km X з"; the split plot treatments may also represent a factorial 
arrangement. The terminology “whole” or "split plot" treatment does not 
necessarily refer to the levels of a single factor, even though the explanation 
55 given in these terms (see section IX-3). ; 

Numerous examples of factors which require large experimental units are 
available in all fields of research. A few of these are listed below to illustrate 
the diversity of material for which split plot designs are suitable. 

Gi) In experimental education a movie film may be used by several teachers and 
on several sets of students; the film is a single experimental unit as far as 
replication on the film is concerned. Perhaps replication on films could be 
obtained by filming the material on different types of film under different 
conditions, by different operators and cameras, etc. . 

(ii) In greenhouse temperature studies, it may be necessary to keep the entire 
greenhouse at a constant temperature. Several treatments may be conducted 
in the greenhouse, but the greenhouse is used as a unit. Heat chambers, 
Storage cellars, freezing units, baking ovens, etc. must also be utilized as a 
Single experimental unit. 
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(iii) In research on milking machines a relatively large amount of milk is re- 
quired. Methods of cooling or pasteurizing require smaller amounts of milk 
and may be utilized as split plot treatments. 

(iv) In the preparation of metal alloys a smelting or blast furnace requires large 
amounts of material, whereas some treatments, such as types of mold, require 
relatively small amounts. 

(v) In palatability studies, one judge, the whole plot, is usually capable of testing 
several items, the sub-plots. Likewise, in rating material a single individual 
is capable of scoring a number of items. 

(vi) In crop tillage or harvesting studies a large amount of material is usually 
required. Hence, sub-plot treatments may be applied to each whole plot. 

(vii) Insecticidal or fungicidal research with ordinary farm operational machinery 


requires large amounts of materials; these may be subdivided into smaller 
lots for sub-plot treatments. 


(viii) In feeding trials the unit is an animal. The carcass may be subdivided to 
study methods of cooking, storage, tenderizing, etc. 
(ix) In baking studies the unit is the batch mixed at one time. The batch may 


be subdivided either before or after baking or both before and after baking, 
to study other treatments. 


(x) The smallest unit in some plant-response studies is a single plant, but the 
plant may be subdivided into subsamples to study methods of chemical 
analyses to determine plant composition. Likewise, to study plant response 
the whole plant is necessary in some treatments, whereas a leaf or half leaf 
is suitable for other treatments. 

(xi) The store or farm may be the unit in certain management studies, while 
methods of displaying or producing the commodity-may be compared within 
the store or on the farm. 


For certain types of material, such as the above, and for cases in which 
the experimental procedure is not clearly stated or understood, it is sometimes 
assumed that the p X q factorial is arranged in a randomized complete block 
design, when in reality the design is a split plot. Such a mistake results in in- 
correct estimates of the treatment variances and in incorrect tests of hypoth- 
eses. It is imperative that the conduct of an experiment be thoroughly under- 
stood in order that correct analyses may be made. 


X-1.2 ADVANTAGES AND DISADVANTAGES 
The advantages of the split plot design are 
(i) Experimental units which are large by necessity or design may be utilized to 
compare subsidiary treatments. 


(ii) Increased precision over a randomized complete block design of the pg treat- 
ments is attained on the sub-plot treatments and on the interaction of sub- 

т] plot and whole plot treatments. 
(ii) The over-all precision of the split plot design relative to the randomized com- 
plete block design of the pq treatments may be increased by designing the 


whole plot treatments in a latin square design or in an incomplete latin 
square design. 
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The disadvantages of the split plot design are: 
(i) The whole plot treatments are measured with less precision than they are in 
a randomized complete block design of pq treatments. 
(ii) When missing data occur, the increase in complexity of the analysis for the 
split plot design is greater than for the randomized complete block design. 


X-1.3 RANDOMIZATION 

The randomization procedure for the main or whole plots is determined by 
the particular design chosen. If a latin square or randomized complete block 
is selected, the experimental layout follows that described in Chapters V and 
VI. If one of the incomplete block designs described in Chapter IX is used, the 
randomization procedure is as described therein. Also, one of the more com- 
plex designs described in later chapters may be utilized. 

The sub-plot treatments are randomly allotted to the units within each 
whole plot. A different randomization is used within each whole plot. Alterna- 
tively, the sub-plot treatments may be designed in various ways to control 
variation among sub-plots within a whole plot. The particular design chosen 
will determine the randomization procedure (section X-1.4.3). 


X-1.4 ANALYSIS 
X-1.4.1 Whole plots in a randomized complete block design. For 
an experiment in which the whole plots are laid out in a randomized complete 


TABLE X-1. Experimental lay-out and analysis of variance for a split 
plot design 
Replicate II Replicate 111 
51092515002 
b 


Replícate I 


b, 


о 


Analysis of variance 


Source of variation 


Whole plots 
Replicate Soe 5 
An (a) (r-1)(p-1) = 6 
Split plots mo 
B = 
Ai (ъ) „ел = x 
Total раг-1 = 55 


4 = p levels of the factor a, and 3 = q 


Block. desira mth 3 аса 
esign with 3 = r rep? grees of freedom is as presented 


levels of the factor b, the breakdown of the de 
In table X-1, 
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From the above, v + 1 normal equations are obtained; thus: 


ea XL Ce ar e d) n (IV-24) 
i=l j=l 
T 
Ac Xy = rl, + rp, ПҮ-25) 
ј=1 
Xs. = Ху = rl + rh, (IV-26) 
X. = X = ri, + rp. (IV-27) 
Using the restriction that Dit: = 0, the least Squares estimate of the mean is 
= о = T. (ТҮ-28) 
Using this estimate of и, the remaining constants are estimated as 
LL c pa (IV-29) 
h=-p=n,-—¢ (IV-30) 
ACE A 
a img, og (IV-31) 
The variances of the estimates may be obtained by the procedure described 
in section IV-2.1.2, 


IV-2.2.2 Expectation of mean squares, Е 
the analysis of variance under two different models, Model I and Model II 
The nature of the data determines the appropriate model, For Model I it Че 
assumed that the treatment effects are fixed (i.e., E[;] =.) and thatthe д are 
random independent variates distributed around zero with a common Yabiniice 
For Model II, it is assumed that the treatment effects, 7;, and the є;; are ran- 


dom independent variates distributed around zero, Th i 
: 2 € variance ii 
сг? and the variance of the є;; is о; шет в 


under Model II are given i 


isenhart [100] describes 
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If Е(т2) = 07, E(e:7) = oê, Е(и?) = p°, and the expected value of all 
cross products is zero, the expectation of the sum of squares due to 2 = @is 


врх.) = PAZA рь У Xie) 
= гур? + ro? + оё. (IV-32) 
The expectation of the sum of squares due to the /; is! 


ЕГЬХ,. ЫХ... X] 
ед 


F 
- ^ E(ru + rri + (ei + 6s + *** + e] EX.2 
=> г n 
i=l 
= (v — 1)(е2 + ге.?). (1V-33) 


The residual sum of squares after fitting the constants и and t; has the expec- 
tation: 


d xxx pote Ух] 


УХ, 2 X.2 Es) 
r 


= OE Batata a SEV ES 
= гуш? + reo? + rvoe — (rep? + rio? + vo) 
= u(r — lee. (1V-34) 


The total sum of squares corrected for the mean has the expectation: 


gd rx. ч | = гш? + rea? + rio — rt? — ro? — сё 
ay 


TU 
= (w — 1)го,# + (rv — De? = (v — 1) (ё + го?) + v(r — l)e? (IV-35) 
= treatment sum of squares + within treatment sum of squares. 


The estimated variance components in an experiment are obtained from 
the various mean squares in the analysis of variance. The estimated variance 
components are subject to error variation. Tukey [294] describes the method 


for obtaining the variance of a variance component. k 
The above expectations were obtained under the assumption that Model 


1The sum of products rule used to obtain the sum of squares due to the f; regression 
coefficients is as follows: (sum of squares due to Ё and 1j) — (sum of squares due to A’), 
where £ and /; are the least squares estimates obtained from formulae, (ТУ-28) to (IV-31) 
and 2 is obtained by differentiating У(Х — ш)? with respect to p’, setting the result 
equal to zero, and obtaining the estimate, A’, which satisfies this equation. For the case 
of equal numbers, fi’ = 2. However, this is not the case for unequal numbers of observations 


or for non-orthogonal classifications. In the following, the above procedure is not always 
explicity stated but the reader should bear it in mind. 
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11 is appropriate. The expectations are changed to some extent if Model I is 
appropriate for the data. The expectations are presented below. 

Given the linear model described by equation (IV-17), let us assume that 
(i) the ej; are random independent variables from populations with zero means 
and that (i) the т; are completely enumerated and fixed. From the above 
assumptions, E(e;?) = её, E(z?) = тё, the cross product of any two єз has 
an expected value of zero, Ё(т;) = 7;, and E(e:;) = 0. We will also require 

? 


that the sum of the тёз equals zero, 2 ут; = 0, although this is not necessary 
i=l 

if we redefine т;. The present definition of т; is: the true treatment mean minus 

the true population mean equals the treatment effect т 

true treatment means is the population mean д. 

With the above restrictions on the linear model, we set up the following 
table of expectations of sums of squares: (This tabular form becomes increas- 
ingly useful as the number of classifications increases.) 

a с=с сс Cw 
y Coefficient of the parameters (Model 1) 


The average of the 


Sum of squares 


r PE P» 
DLX rv rv r 
(2) 5 Xi2/r rv v r 
(3) X..2/rv rv 1 0 
Among treatments = (2) — (3) 0 (v-1) r 
Within treatments — (1) — (2) 0 v(r — 1) 0 
Tia: i кы a a‘ 


"Therefore, the expectation of the treatmentmean Squareisg? 4+ —" — > E 
as contrasted to the expectation c4 + rg, obtained under Model т. 624 
IV-23 COMPLETELY RANDOMIZED DE 
OF REPLICATES PER TREATMENT. ON WITH UNEQUAL NUMBERS 


IV-2.3.1 Least Squares estimates of effects. 


ететан мшш For the case where the 
Жи = Ула Уга, 
Xi = nf + nA, (ven 
Xs. = role + гр, (Iv-38) 
ZUM = rd, x Toh. (IV-39) 
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Applying the preceding method for estimating р, the mean is estimated as 
x t SL Boc ы 
а= з= nh e tre (IV-40) 
This is not free of the £j. Now any class mean, д, minus the general mean, u, 
is r; An estimate of the т; is obtained as follows: 


щы э » (V41). 


Ti 


Summing the above v equations, an estimate of » may be obtained from the _ 
following equation: 


Xt: =0= (29) — 00; (IV-42) 


p= 15000) -2 qvas 
— arithmetic average of the treatment means. 


thus, 


IV-2.3.2 Expectation of mean squares. The expectations of the 
total sum of squares, correction term for the mean, treatment sum of squares, 


and error sum of squares are, respectively: 
? 


Dp» = xe | = Lr: +02 + сё), (IV-44) 
| y (5) y сш аг; (IV-45) 
dz x 24 =@= + (zx = Z : ye (IV-46) 
d 
^ gju- 


= 


= Ула + od + od) — Gc? + Dril? + P) 


i=l 


2 Xn - o). (IV-47) 


i=l 
ents are the same as those obtained above for 


Tf the r; = r, the above coeffici b 
er treatment. The expectations of the mean 


equal numbers of individuals p 


1; = 0 is used, the estimates #,’ will not be the same as those ob- 
ifference t; — t; is equal to the difference ¢;’ = t". If one 
differences between effects, the restriction Drit: might 


1Jf the restriction Jors 
tained in (IV-41). However, the d 
were dealing only with estimated 
validly be used. 
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squares under Model I may be obtained in the manner described in the last 
part of section IV-2.2.2. 


IV-2.44 EXPECTATION OF MEAN SQUARES IN HIERARCHICAL CLASSIFI- 
CATIONS 


IV-2.4.1 Three categories of variation. As described previously the 
treatment degrees of freedom may be partitioned into several parts. Suppose 
- that thev treatments are composed of k A classifications with n, B classifica- 
tions in the gth A class and that there аге rg: individuals within each of the 
B classifications. We assume the linear model, 
Хоз = uei + €gij = и (ро. — н) + (Hoi — ну.) + є0+ј 
= p + ay + Boi + є, (IV-48) 
where g = 1, 2, ---, В; i=1, 2, eee, n5 ј = 1, 2, - го; ор = шу. — H 
= additive effect of gth A class; бш = gi — и. = additive effect of ith B class 
in the gth A class; egi; = а random error associated with an individual; and 
и = population mean. Furthermore, assume that the expectation of cross 
products of different effects equals zero, that each effect has expectation zero, 


oe that Elag) = сё, E(B?) = o, Eleni?) = оё, 
я, k 
Ул = г, Уу уги = r.., and Ул = n. = v. 
rest rr 0-1 


With these definitions and assumptions the expectations of the various sums 
of squares are 


Coefficients of the parameters (Model II) 


Sums of squares 


ut са? op og 
ee 
k 2e gi 
a 2 >; Ха? К т.. г Ps 
халла г.. T. г п.=у 
УХ,./г. г.. г Tgi? 
x Š T Yu. s 
2 
Vises r.. Уау. УУ айу. 1 


From the above table of сое сі i 

cients the expectations of the vari 

h l ari 

of squares in the analysis of variance table are [132] RPT 
Source of variation df Average value of sum of Squares 

Among A classes К —1 


аы ые стин аай 
Among B classes Са ха. -i] [ег + fr. - оа 


withinA classes v —k (v – ke? + (r.. — XXn. ] og? 


Within B classes r.. — y 


— “ ч е е - 
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For an illustrative example of the above analysis the reader is referred to 
the data presented in tables 10.12 and 10.18 of Snedecor’s book [273]. The 
average value of the mean square for litters is (in Snedecor’s notation) 
с? + 1.00c;? and of the mean square for litter sizes is c? + 6.76072 + 13.5205". 

In the above analysis of variance, it should be noted that there is no 
appropriate error mean square for testing the A class mean square in an F test. 
The disagreement in the coefficients of oz? in the two mean squares from table 
10.18 [273] is not serious in this case, 7.00 vs 6.76, but it could be in certain 
cases. This is another undesirable feature of unequal numbers in the completely 
randomized design [270]. 

IV-2.4.2 Five categories of variation. A more complex classification 
is presented by the data of example 10.19 in Snedecor's book [273]. 'The linear 
model for this example is 


Xigus = p + a; + Bij + Yük + бу F ут 


N * 
where d= Lie af = Е оази M 


m = 1, 2, +++, rig) li = УХУ; гу. = 222 hei IE гуы NA 
j k 9 ko Tous f о зї \ 

and the totals of the X's are obtained Ьу summations over th Ais similar to 
those above for the r's. The effects a, 8, y, ô, and e are considered to be random 
independent variables with zero means; this corresponds to Eis 
II [80, 100]. However, a mixed form of Models I and II is probáblyguoreap- ^ ү 
propriate for these data. The sums of squares and expectations аг Ў 
in tables IV-4 and IV-5. 

IV-2.5 REGRESSION COEFFICIENTS AND THE INTERCEPT 


IV-2.5.1 Least squares estimate of the regression coefficient and 
The method of least squares for estimating the regression 
say X, and the intercept of the line from 


\ 


of the intercept. 
of one variable, say Y, on another, 


th i 2 
e equation, арх (IV-50) 


is discussed by Wilks [310, sec. 13:21 to 13:24] and by several other authors. 
Suppose that some variable is observed on n individuals randomly drawn from 
a population and that the measurements or counts are recorded as Yi, Ys, 
-++, Y„. Also, suppose that an additional observation is made on each of the 
n individuals; i.e., X1, Xs, +++, X». The X; are called the independent variates; 
they are known without error and are often selected values. This means that 
on each individual item a pair of observations, Y ; and X;, is made. If the paired 
data are plotted in the X Y plane, the n pairs of observations form n points or 
coordinates. The resulting plot is a dot or scaller diagram. Now, it is desired to 
determine the best-fitting straight line to the n points or coordinates in the 
sense that the sum of squares of the deviations of the observed values, Y, 
from the corresponding calculated value, Y, on the line will be a minimum. In 


s af Ted 
эт ау? Cr 
туту Ке 


"Supr. АЎ: x Bet 


suus зугт 


зләфәшоләї JO 5ўиәторууәод 


so1enbs Jo sums snorrea әү} 10у $193oure1ed jo 5}пәгәцуәогу `Р-АТ S'IS VIL 
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TABLE IV-5. Analysis of variance 


Source of variation Average value of sum of squares 


e-»xerbp-] hk 


Among B classes 
within A classes 


Among C classes Xi gee. ИЕ IE даю 
within B classes YU ETT 21455 i E LM p 


Among D classes 
within C classes 


Among Individuals 
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order to do this, it is necessary to determine the intercept, a, and the slope of 
the line, b. Since these values are determined to make the sum of squares of 
residuals a minimum in the sample, the partial differentiation is with respect 
to a and to b. The value for any randomly drawn value Y; may be expressed 


in a linear form in terms of the population parameters о and 8 or in terms of 
the sample values; thus: 


Yi=a+8X:te=a + bX; + e; (1V-51) 


where e; is a random variable and e; 
Y; and the calculated Y; 
estimates of a and B, 
of squares, 


is the deviation between the observed 
= а + bX; and where a and b are the least squares 
respectively. The proper choice of a and b makes the sum 


à. -fy- à. —a-— bX) (IV-52) 
a minimum. Now, B z 
ARs = a 8x] = -22(¥%:—a-0X)=0,  Qvsm 


and па =D = bx (IV-54) 
ог а = ӯ — bi. (IV-55) 
SEQ aD a xy, a- bX) = 0. (IV-56) 


1929,6, у BAY: mx a? X;. (IV-57) 


The two equations and two unknowns may be solved as follows: 
Multiply equation (IV-57) by n and (IV-54) by УЎХ, to obtain 
BEX? + aX; = 3X Yn (IV-58) 
[ЬУ X: + na = DS KDX (IV-59) 
Subtract (1V-59) from (ТҮ-58) and obtain 
Ма Хг — (УХ дг] = AUX; — OXY, (IV-60) 
The least squares estimate of the slope is 


be MEA УХ БУ, У(Х, 2)(Ү; — 9) 


and 


and of the intercept is 
= 9 — be = gy — g UK — 2 — g) 
а=9— 2 = 7 а A - т? (ТҮ-62) 


Тһе values 4 and b define the "best-fittirig" straight line to the n pairs of 
points in the scatter diagram. 


EE SR ay oe, 
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In order to introduce the reader to the solution of simultaneous equations 
by the method of determinants, the solutions of a and b from the two normal 


equations, 
by X24 a3 Xi = DAY; (IV-63) 


by, X; + na = Ys (IV-64) 


are obtained in the following manner: 
UX Y: УХ; | 
AE Ys n be. n$,X;Yi = УХУ Y; 
a xX? LX: nx? — (LAG)? 
LX: n | 
Z0; —-20:—9 (IV-65) 
XXX; — 2)? 


and 


and 
| iX? EX:Y: | 
УХ; x5 = LANAS = TXDTAY: 
rS XX | ni Xe — (УХ) 


| EX: n 


a fà xq — (Vi — 9. (IV-66) 
У(Х: - 2) 

ons the notion of a matrix and of an inverse 

for obtaining the variances of least 

x of coefficients is 


For more complicated situati 
matrix [116] is of considerable importance 5 
squares estimates. For the preceding example the matri 


DUX Ух: 
Ух; n 
and the determinant of the matrix is 
LX? DA: | = пуха OX) =D. (10-68) 
DX: n 


The inverse of the matrix of coefficients, 


(ТҮ-67) 


equation (IV-67), is 


21 
5 ZEX: 
iX? УХ i = D 
УХ; п - XD xXx? 
E = 
р 5 ES P xa sje avem 
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The predicted values of the dependent variate for a given X; in terms of 
the least squares estimates are given by the equation, 


¥;=a+ bX; =9+0(X; — 3). (IV-70) 


TV-2.5.2 Variances and covariances of theestimates. The variance 
of the least squares estimate of the population intercept о is E(a?) — «2. Now, 


ве = BEX _ соха рози, 


n пу Хг (XF 
-Е|.+в Es. (ха - Back) poeta Xu 
= 4+ a (Ze) + «(Жаы 2) + cross products | 
zd Dre (IV-71) 


The variance of the least Squares estimate a is E(a — a)?; the estimated 
variance is 


2 6X 
ё? = руз? i (IV-72) 


where 2.2 is the sample estimate of the population parameter o2 
Also, the variance of the estimate b is E(b?) — g 


EQ) = Ре а р - XXX E 


А 
= 0°, 


DEUX um 
Е У(Х — 2) (6 — Je à Cetus 3) 
# +0+ 20-260 = В be gar: (IV-73) 
Therefore, 
2 
EO- в = sree gi (IV-74) 
and is estimated by 
= ёё 
T == x 
The covariance of а and bis 
А reL X; 
E(a— a)(b — в) = oa = = = 5 (IV-76) 


and is estimated by 


—é2 
E E ay (IV-77) 
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The elements of the inverse matrix times 2.2 give the sampling variance of 
the least squares estimates a and b and their covariance; thus: 


(OS ae ёё 
дё = Bee = уңу сту (IV-78) 
ШОО О pL oou 
62 = туух meer О? 
and = xX, —ó L A; 
bas = = EDS aF ee 


The above use of the inverse matrix is extremely helpful in obtaining the 
variance of least squares estimates in the more complex cases. 

With these results in mind, it is possible to obtain the reduction in the sum 
of squares due to the intercept, а (in reality a regression coefficient), and to the 
linear regression coefficient or the slope, b. The reduction is the value of the 


quantity, a? Y: + b» X:Y;. 


It will be noted that the totals )) Y; and УХХ; Y; are the right-hand sides of 
equations (IV-63) and (IV-64). The total sum of squares, > Y?, minus the 


sums of squares due to the regression coefficients a and b, has the expectation: 


Bl xv: Ae xy, | = a| ve -O s] 


YBa + BX; + ey — *E(ne + BOX: + Da)? — EXIQG — | 
no? + BX? + ne? — ne? — FEQQ! cà — EE: — 8)*(eg! + 8) 


(n — De? — eg (Xi — 2)? = (n — 20e (IV-81) 
Hence, division of the residual sum of squares by n — 2 results in an unbiased 


estimate of the variance in the population. А és 
The expectation of the residual sum of squares may be obtained similarly 


for multiple regression, that is, for each observation Y; some related values 
Xin Хо, etc. are observed. Their effect on the variation in the Y; may be 
removed and the residual sum of squares may be obtained. For the case of two 
independent variables, X, and Хз, the sum of squares to be minimized is 
02106 — а — bXy — СХ), (IV-82) 


ll 


the th 1 tions are 
ЕИ" (ED DISPO (1V-83) 
aY Xs + 50 + ХА = 2X8 Y E 
(IV-85) 


аў Xz: + b» XuXs + EX? = УХА 
and the reduction in the total sum of squares due to the three estimated 


regression coefficients a, b, and c, is 
ap Y: + bo XuY: + сХе. (IV-86) 


The residual sum of squares has the expectation (n — 3)02. 
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Ү-1 Applications of the Randomized Complete Block 
|. Design 
У-1.1 INTRODUCTION 
If the whole of the exp 
geneous, it may be possible to Stratify or group the ma 


of fields. 115 flexibility a 
most popular of all designs, wi 


(i) Accuracy, This 


randomized q accurate than the completely 


t types of experimental work. The elimination of 
па 
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the block sum of squares from the error sum of squares usually results in a 


decrease in the error mean square. 

Flexibility. No restrictions are placed on the number of treatments or on the 

number of replicates in the experiment. In general, at least two replicates 

are required to obtain tests of significance (see chapters on factorial experi- 
ments for exceptions). In addition, the check or other treatments may be 
included more than once with little complication to the analysis. 

(iii) Ease of analysis. The statistical analysis is simple and rapid. Moreover, the 
error of any treatment comparison may be isolated and any number of 
treatments may be omitted from the analysis without complicating it. These 
facilities may be useful when certain treatment differences turn out to be 
very large, when some treatments produce failures, or when the experimental 
errors for the various comparisons are heterogeneous. 


The chief disadvantage of the randomized complete block design is that it 
is not suitable for large numbers of treatments or for cases in which the 
complete block contains considerable variability. 

Cochran [45, 47] found for experiments carried out at the Rothamsted 
Experimental Station and associated centers between 1927 and 1934 that the 
error mean square for a randomized complete block design was 60 per cent 
of the error for completely randomized designs. Assuming equal costs of con- 
ducting the experiments, it would require ten replicates of a completely 
randomized design to obtain an amount of information equal to that from a 


randomized complete block design with six replicates. 
У-1.3 LAYOUT AND ANALYSIS 

Using the same example as for the completely randomized design the five 
treatments А, В, C, D, and E may be included in each of the four blocks. 
The following diagram illustrates an experimental layout for the field, labo- 


ratory, or greenhouse: 


(ii 


2 


Ks ПИШЕ ОЧЫ UE 
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The breakdown of the total degrees of freedom is 


Source of variation df Mean square 
Among 5 treatments 4 T 
Among 4 blocks 3 R 
Remainder or error 12 E 

19 Er. 


"Total r 
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As is apparent from the analysis, 3 of the degrees of freedom are segregated 
from the error degrees of freedom for a completely randomized design. These 
3 degrees of freedom are associated with the sum of squares attributable to 
the differences among the means of the four blocks. 

For the general case with v treatments and r replicates the breakdown of 
the total degrees of freedom for a randomized complete block design is 


Source of variation df ss ms 
LM... lS 
Replicates г-1 3 X g/v — Х.2/ту R 
1 
Treatments y-1 Y Xr — X.2/rv T 
i 
Residual (r—1)(v—1) LEX = DXi — YXQquv +X.?/rv Е 
Total (corrected forthe* ry — 1 DEK X.2/rv = 
mean) inj 
Correction for the mean 1 Х.2/ту ст 


C———————————— ÁO PIT 


X;. = ith treatment total X з = ЛЬ replicate total, X.. = total of ry 
experimental units, and X;; — yield of the experimental unit from the ith 
treatment in the jth replicate. 

If an experiment has been conducted as a randomized complete block 
design, it is possible to determine the efficiency for the same experiment 
conducted as a completely randomized design [319]. The calculated variance 
for the latter design is obtained from the sum of squares for replicates plus 
the sum of squares obtained by multiplying the error mean square by the 
treatment plus error degrees of freedom and dividing by the total degrees of 
freedom. Symbolically, this is 
E- (treatment plus error degrees of freedom) E + (block degrees of freedom) R 

treatment + error + block degrees of freedom 


a b= 1+ (r— 1)(0 — D]E + [r — А 
[0-10 + (= 0 1+ (= 0] = љ-=тТ (V-D 


or variance. The effici i 3 
plete block design relativ. HD bd a copia] eH 
` design been used is 

(r—1)(—1)4-1 v(r—1)+1 ( 

i —(ro—r—»-2)(ro—»--3)E 

[(r-D(—-1)*3]E Б fore 4) DE 

where the coefficients represent th. i 
e ‚ 216 corrections for the difference in the de- 
grees of freedom associated with E' and E [126, and {наша (1. The 


(V-2) 
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increase in efficiency due to the use of the randomized complete block design 
rather than the completely randomized design is obtained from equation 
(V-2) minus one expressed in per cent. 

With reference to the suggested replicate shape and the suggested size 
and shape of experimental units within the replicate the reader is referred to 
Chapter III. The amount of replication required will depend upon the pre- 
cision with which the experimenter wishes to measure the treatment means. 
He usually has some idea regarding the coefficient of variation in the material 
under observation. Also, he has some idea of the size of the diference be- 
tween two treatments, which is of practical significance. From this infor- 
mation the approximate number of replicates to use may be obtained by one 
of the methods presented in Chapter ш. 

The layout, computational procedure, and efficiency of randomized com- 
plete block designs are illustrated in the following examples. 

V-1.4 ANALYSIS FOR ONE OBSERVATION PER EXPERIMENTAL UNIT 

The general analysis for complete “harvesting” of the experimental unit 
is presented above. In this case the total for an experimental unit is used; 
this results in a single observation per experimental unit. The computational 
procedure for a randomized complete block design with one observation per 


experimental unit is illustrated below for a particular example. 


Example V-4. Lathwell and Evans [191] present yield data from soybean plants 
for five treatments grown in six replicates of a randomized complete block design. 
The experiment was conducted in the greenhouse, using sand cultures in pots. The 
five treatments are LLL, LLH, LHH, HLL, and HHL, where L refers to a light appli- 
cation of nitrogen (20 parts per million), 


H refers to a heavy application of nitrogen 
(100 ppm), and the position of the letter: application, 1.е., first, 
second, and third dates. The additional treatme 


s refers to the time of 

nts, LHL, HLH, and HHH, would 

have resulted in a factorial arrangement of treatments, but they were not included in 
this experiment. Х 

The yield data and analysis of variance for the soybean experiment are presented 

in tables V-1 and V-2. The total sum of squares corrected for the mean 1s obtained as 

8.8? + 12.9? + «++ + 36.6 — 924.92/5(6) = 35675.03 — 28514.67 = 7160.36. The 

treatment sum of squares is (96.5° + 134.7? + 170.0 + 220.12 + 303.67)/6 — 9289/30 

= 4314.21. The replicate sum of squares is (16933? oes + 136.0°)/5 — 924.9°/30 

is obtained by subtracting the treatment, and 

thus, 7160.36 — 4314.21 


replicate from the total sum of squares; 
ER LE LS um of squares may also be obtained as the sum of 
ted values, £j. + 2.5 


— 466.44 = 2379.71. The error 8 
squares of the deviations of observed values, Xij from ехрес 
LÀ T Lx 

— $ = Xj ie. the error sum of squares equals AX (Xy — А. 

i i he treatment 
Snedecor’s F is used to test the hypothesis of по difference among t 
means; F = 1078.55/118.986 = 9.06 > Fa(4,20df) = 4.43. nica we ie 
reject ü i ce among the five treatment means. If it is not pos- 
MADE P should have applied one 


sible to partition the degrees of freedom for treatments, we 
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TABLE V-1. Yields of Soybeans in grams per pot 


plor 566.1, 
Treatment 4 4314.07 
Linear component T 
Residual 5 
Error 


20 . . 
Total 29 1160.56 
Correction for mean 1 28514.67 
Sa MORE 20 ET. 


AE c tiple comparisons test indicates 
that mean number 5, HHL, is significantly higher than 


of squares and 
range almost large enough to exceed the required value at the 5 Der cent level, 


[—6(96.5) — 134.7 — 170.0 + 4(220.1) + 4(303.6)]2 
S-60 + (1)? + (1 4 42 + 4) 


= (1211.1)2/499 = 3492.29, which represents a sizeable portion of the treatment sum 
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of squares. The coefficients c(N; — т), of the treatment totals, X;., are the deviations 
of the amounts of nitrogen, /V;, per treatment from the mean nitrogen application 
where (60 -+ 140 + 140 + 220 + 220)/5 = 156 = п; the deviations are coded by 
dividing by 16. Since the sum of the cross products is squared, and since the code 
number is squared in the denominator, no decoding is necessary; thus: 


90%; — DX: [XN DX}? 
XN; — п)? ENa (V-3) 


Where c is the constant used to code each Ni. 

Furthermore, the sum of squares for the other components, quadratic, cubic, and 
quartic, could be computed if so desired [273]. Since only three levels of nitrogen, 60, 
140, and 220,were used, it was deemed inadvisable to compute any of the curvilinear 
regressions for the particular range of levels explored. The sum of squares attributable 
to linear regression represents the largest portion of the total treatment sum of Squares 
(table V-2). The test of the hypothesis of zero relationship between nitrogen and yield 
is F = 3492.29/118.986 = 29.4 > Fo(1,20df) = 8.10. The mean square for residuals 
= the deviations from linear regression is not significantly larger than the error mean 
Square, since F = 273.97/118.986 = 2.30 < Fio(3,20df) = 2.38 (table 11-8). 

Е r —6(96.5) — 134.7 — 170.0 + 4(220.1) + 4(303.6) 

or the mean difference, 66) + (=I? + (=)? + e+ 4] 


= zua = 2.8836, the standard error is obtained from the formula, 
5 , 
$i = уту (V-4) 
where the k; are the coefficients of the treatment totals, s? — error mean square, and 
r = number of replicates. The standard error of the mean difference 2.8836 is 


0.53226. 


118.986 1 
/ 6 ECT + (—1?-4 (-1?+ 4 + zl 


-F- f significant 
Therefore, £ = 2.8836/.53226 = 5.418, and £ = F = 29.4. The number of signifi 
figures carried is larger than warranted by the accuracy of the data but is required 


to obtain agreement between /? and F. Ж 
The standard error of the mean difference between two treatments is s; = 


2(118.986)/6 = 6.30. The lsd = salos(20df) = 13.14. The coefficient of variation is 
5/2 = 304/118.986/924.9 = 10.91/30.83 = 35 per cent, which appears pae rather 
large. Since th licate mean square is less than the error mean square, the present 
design is fess often than a completely randomized design would have been. If it is 
desired to compute the efficiency of the randomized complete block design relative 
to the completely randomized design, formula (V-2) is used. For the data of table 


V-2, Ry — 466.44 + 24(118.986) _ 114.555. Therefore, the relative efficiency is 
estimated to be 


30 — 6 — 5 + 2)(30 — 5 + 3)114-555) > уор 95 " 
ЕЕ БҮЗ — 5 + 1)(118.98@ X роса 
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y-1.5 ANALYSIS FOR MORE THAN ONE OBSERVATION PER EXPERI- 
MENTAL UNIT 


For k observations, readings, or determinations per experimental unit the 
following analysis of variance table is appropriate: 


Source of variation df ss 

Replicates r—1 Eau — X..2/rvk 
Treatments v—1 халк — X..2/rvk 
Poors Ge) з PA HRE 
Sampling error д rv(k — 1) xr(Exw = Э) 

"Total rvk — 1 XXX = Х:.2/туК 


where the X;.., X.;., X..., and X;;. equal totals for the treatments, replicates, 
the grand total, and the experimental unit, respectively. The detailed compu- 
tations are presented in example V-2. 


Example V-2. The data presented in table V-3 represent the grams of rubber 
obtained from two randomly selected plants in a plot for each of the seven varieties of 
guayule planted in the five replicates. The allocation of the varieties to the seven 
plots in each replicate was random. The plot size was twenty-eight plants long by 
twelve rows wide, with 20 inches between plants within a row and 24 inches between 
rows, resulting іп a plot of (1/12)[(28 X 20) X (12 х 24)] = 4624' x 24/. The repli- 
cate size was 7 X 24' by 4624’ = 168' X 4624’. The shape of the replicate might not 
have been the most desirable. Plots six rows wide by fifty-six plants long might have 
resulted in a better replicate shape in this experiment. 

The sums, means, and sums of squares for the data in table V-3 are presented in 
table V-4. The results are summarized in table V-5. The mean squares are obtained 
by dividing the sums of squares by the appropriate degrees of freedom. 

In table V-5, two errors are listed, experimental and sampling. The experimenter 
may often be in a quandary as to which one to use [126, sec. 65; 47, p. 28-35]. The answer 
depends upon the hypothesis to be tested and upon the assumptions made about the 
data. If the worker wishes to confine his remarks to the particular five replicates used 
above, the sampling error is used for testing the variation among variety means. If, 
on the other hand, the experimenter wishes to make an inference about the true dif- 
ferences among the seven varieties from the random sample of five replicates, the 
experimental error is used. The last cited instance is the one of practical importance 
in most cases. 
^ ae gone error is larger than the experimental error, but not significantly 80. 

e variation of plot means from plot to plot after removing replicate and variety 


Tet 


TABLE V-3. Field arrangement of 7 varieties of guayule in 5 randomized complete blocks and weight (grams) 


of rubber for 2 randomly selected plants 


6 - 406 


5 - 595 
6.85, 4.94 


4 - 109 2 - hos 


6.65, 6.17 


1.46, 6.39 2.53, 6.95 


8 - 109 
4.07, T. 75 


10 - 05 11 - 406 12 - №16 
1.85, 6.44 4.06, 6.65 4.35, 5.85 


20 - hof 
2.59, 4.79 


18 - 416 17 = 150 


6.42, 4.72 


22 - 130 23 = 109 2h = 405 25 - 416 26 - 595 
4.43, 1.531 6.8%, 0.89 6.49, 8.55 5-41, 0.87 6.71, 6.67 
35 - 595 3h - 130 35 - 416 32 - 105 31 - 406 
5.82, 5.08 6.64, 5.92 0.48, 1.97 7.30, 4.19 8.11, 5.95 


2% x 7 = 168 ft. 


2.03, 5.08 


5 x 46 2/5 = 255 3/5 ft. 


27 - 407 
6.46, 10.66 


30 - 109 


1.35, 5.55 


?First no. = plot по. ; second no. = varietal designation, and last two numbers = weight of rubber in grams 
from the two plants. 
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TABLE V-4. Totals of plot yields and sums of squares 


"Total sum of squares with 69 df: 
(371.52)? 
2.06? + 6.12? + 2.53? + ... + 5.92? + 5.08? — a 
= 2287.489 — 1971.816 = 315.673. 
Sum of squares for replicates with 4 df: 


1 n 2 2 
ERECTA 8 (TL52)? L 1993.811 — 1971.816 = 21.995. 


Sum of squares for varieties with 6 df: 


2 n 2 
SI: - SESS ы (81.523 = 2042.747 — 1971.816 = 70.931. 


Sum of squares of plot totals with 34 df: 


2 T 2 3 
7.85: + 11.80 = + 10.90? — CT 52) = 2148.102 — 1971.816 = 176.286. 


Sum of squares for interaction of replicates and varieties (by subtraction): 
176.286 — 70.931 — 21.995 = 83.360 with 24 df. 


Within plot sum of squares: 


- .82 — 5.08)? 
(6.12 ZO е z ) 


(6.12 + 2.06)? 
2 


= 6.12? + 2.06? — + +++ + 5.821 + 5.08? — 


= 2287.489 — 2148.102 = 139.387 with 35 df. 


(10.90)? 
2 


TABLE У-5. Analysis of variance for the data of table V-3 
Source of variation 

Replicate 

Variety 


150, 406, 595 vs 
Among 5, hor, 


Experimental error 
Sempling error (between Plants) 


405, hoy, 4: 
vig» MoT, 116 
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effect is zero in the population, it would be expected that the experimental error would 
be smaller in about 50 per cent and larger in 50 per cent of the samples. If the latter 
error is significantly smaller than the sampling error, it would be concluded that a 
significant negative intraclass correlation [127, 273] exists. The explanation would 
depend upon the particular type of biological material involved. 

Even though the experimental error is the smaller of the two variances, it is the 
best estimate of the error term for testing the significance of the difference among 
treatment means. The experimenter may wish to be more "conservative" and to use 
the sampling error and the degrees of freedom associated with the experimental error. 
Other schemes could be followed, but the most logical one is to use the experimental 
error as the estimate of error variation in making various tests of hypotheses, since 
this is not a result-guided procedure. 

The F test of the differences among the seven treatment means is 

_ 118218 
.. 8.4133 
For 6 and 24 degrees of freedom the F values at the 2.5 and 1 per cent points are 2.99 
and 3.67, respectively. 

The next question of importance would be to determine which, if any, of the seven 
presumably unrelated varieties are significantly different with respect to yield of rubber 
at the end of one growing season. To make these comparisons, several tests are sug- 
gested in Chapter II. Tukey's test (sec. II-1.3) indicates that variety 416 is significantly 
lower than the others in yield of rubber and that the variation in yield among the 
remaining six varieties might logically be ascribed to chance. 

The methods in Chapter II are applicable to a group of unrelated varieties or treat- 
ments. In this case, considerable information concerning the relationships of the seven 
varieties was available from past experiments. Variety 109 is the only 54+ chromo- 
some variety in the group; the remaining are in the 72 + category. A logical comparison 
is the mean of the 72's versus the mean of the 54-chromosome variety: 

. — 57.09]? 


[6(51.12) — 59.16 — 50.38 — *- 
1036--1+iti+it+1+) 


51.122 , (69.16 + +++ + 57.09)? _ 371.52 _ 0,4456. 
10 ar 60 70 


Also, it is known that varieties 406 and 130 a 
freedom among these three means could logically be 
Brees of freedom representing the comparison of thie 
variety and the comparison between the selections; 


Among 130, 406, and 593: 
59.16? + 66.60? + 57.09? _ 182.85 _ 5.0026. 


= 3.40. 


are selections from 593. The 2 degrees of 
partitioned into two single de- 
two selections with the parent 


130 + 406 vs 593: 
[59.16 + 66.60 — 2(57.09)]° к 
10(1++1-+4 


(59.16 + 66.60 + 57.09)" _ 2,2349. 
= 30 


57.09? (59.16 + 66.60)? 
10 + 20 
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130 vs 406: 
(59.16 — 66.60)? 
10(1 + 1) 
Furthermore, varieties 130, 406, and 593 are phenotypically different from the 
remaining three varieties, 405, 407, and 416. The former have round greenish leaves 
and short branching habit, while the latter group have long serrated grayish-green 
leaves and longer branches. A logical comparison would be between the means of the 
two groups, 
(59.16 + 66.60 + 57.09 — 50.38 — 55.46 — 31.71)? = 34.2015 
10 +1+1+1+14+) ез: 
The remaining 2 degrees of freedom make up the comparisons among the three 
varieties 405, 407, and 416, with'the following sum of squares: 


50.38? + 55.46? + 31.71? — 137.55? 
10 oS Serr vies 31.2813. 


= 2.7677. 


It was not known what relationship existed among the three varieties, and without 
this information the partitioning of the variety sum of squares is finished. Tukey’s 
test (section IJ-1.3) may be applied to these three means, resulting in two subgroups, 
405 and 407 in one group and 416 in the other. The sums of squares are summarized 
in table V-5, and as a partial check they should add up to the total 70.931. 

F = 9.85 exceeds the tabulated F at the one per cent point, and F = 4.50 exceeds 
the F value at the 5 per cent point. The means of the three varieties, 130, 406, and 
593, and of the three varieties, 405, 407, and 416, cannot be considered as coming 
from the same general population. Upon examination of the latter three varieties, it 
is found that they do not represent a homogeneous group and that the very low yield 
of variety 416 accounts for the large F values in both instances. 

The amount of variability relative to the mean in this experiment was much higher 
than desired. The coefficient of variation is 

3733/2 1.318 _ 
узату = $307 ^ 25 per cent. 


The standard deviation per plant mean yield is \/ 3.4133/2, which equals the standard 
deviation resulting from an analysis of the plot means. The verification that division 
of the error mean square by two (equals number of items from each plot) results in 
the same value as that obtained from using the plot means in the analysis is left as 
an exercise for the student. 


The efficiency of this design relative to a completely randomized design is estimated 
to be 
21.995 + 3.4733(6 + 24) 


= 4+ 6 + 24 3.7116 

Ef = 

Wie 3.4133 3.4733 101 per cent, 
от a gain in efficiency of 7 per cent. 


V-L6 UNEQUA 
AND UNEQUA 


i NUMBERS OF OBSERVATIONS PER EXPERIMENTAL UNIT 
REPLICATION PER TREATMENT 


Missi i ; ; 
mental HEN а units or unequal numbers of units within the ехрегі- 
еп create analytical difficulties. Regardless of precautions 
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taken, disproportionate results are occasionally inevitable. The experimenter 
may start with an equal number of replicates or with experimental units of 
equal size, but some of the animals may become sick and die; the technician 
may accidentally omit or mix up some of the results; a part or all of a field 
plot may not germinate or may be cultivated out; or any of several other 
things may happen to a part or all of the experimental unit. When faced 
with such results, the experimenter would still like to obtain all the informa- 
tion possible from the remaining observations. In response to this need, 
statisticians have developed several analytical procedures for handling dis- 
proportionate results from experiments [1, 17, 18, 33, 111, 139, 164, 189, 240, 
276, 316, 318a]. 

Before discussing the various m: 
out that the method of calculation 
than is present in the data themselve: 


ethods and situations, it should be pointed 
does not contribute any more information 
s. In other words, when a "missing plot" 
value is computed, no new datum is added. The procedure is merely a calcu- 
lational dodge for circumventing a more complex procedure. 

Also, some computational procedures are exact, while others are approxi- 
mate. The investigator must know the assumptions underlying a particular 
procedure [54, 100, 175, 276, 290] before he is assured of the correctness of the 


procedure. 


V-1.6.1 Missing experi 
the first to present a formula for co 
extremely divergent value for a randomi 
[316] showed that their [1] formula resulted in minimizing the error sum of 
squares. He presented an iterative procedure for calculating the values for 
several missing experimental units. The validity of the analysis of variance 
procedure was investigated, and it was shown that there is little disturbance 
provided that the proportion of missing values is not large and that the number 
of degrees of freedom for the ordinary randomized block experiment is reduced 
by the number of missing plot values computed. The expectation of the treat- 
ment mean square is too large in that the coefficient for the error variance 
component for the error variation is larger than one [111, 316]. This means 
that the F ratio is too large relative to the correct ratio and too many signi- 
cant results are obtained. It has been shown that the correct mean squares 
may be obtained without too much difficulty [111, 316].! 

The value of a missing experimental unit from the ith treatment in the 


jth replicate is computed from the formula, 


mental units. Allan and Wishart [1] were 
mputing the value for one missing or 
zed complete block experiment. Yates 


vX;. + rX.j = XS 
Ka ee DG (V-5) 


where the X;. equals the total of treatment i in the (r — 1) replicates in 
which it is present, X.; equals the total of the (v — 1) treatments in the jth 
replicate, and X.. equals the total of the rv — 1 observations. The applica- 


‘The treatment mean with the missing plot is equal to X; = (Xi. + Syr. 
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tion of the above formula is illustrated in several places in the literature 
[e.g., 1, 60, 273, 316]. 

Formulae have been developed for various numbers and arrangements of 
missing plot values [14, 17, 18, 111, 189] and are useful in special cases. Use 
of these special formulae often results in a considerable saving of computa- 
tional labor, but if only one procedure is to be followed, the iterative proce- 
dure proposed by Yates [316] is recommended. If more than one value is 
missing, we guess-estimate the values for all but one of the missing units; 
this one is computed from formula (V-5). Then, we use formula (V-5) to com- 
pute the value for one of the other missing values. This procedure is continued 
until the computed values become stabilized for each of the missing units. 
Usually, three cycles will suffice, but the number of cycles depends upon the 
closeness of the guess-estimates to the computed values. The procedure is 
illustrated by Yates [316] and Snedecor [273]. 


putational analysis is illustrated by them [276] 


and by Snedecor [273]. nd upon 


the assumptions about 


9, 276]. In the first procedure 
т each experimenta] unit, and 
means. The second procedure 


the means per individual or unit are obtained fo: 
the analysis of variance is performed on the 
involves obtaining expected subclass numbers and completing the analysis 
utilizing the expected values. Crump [79] has obtained the expectations of 
the mean squares for these two approximate methods. 

Another approximate procedure is to observe th 
the smallest number of units, say kj. 


ИР V-L63. Other Situations, It may happen that the yields of two or 
Te experimental units are available in tolo but not individually. The ex- 
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perimenter may unwittingly bulk the yields from two or more treatments 
before weighing, but is able to obtain the total weight of the bulked items. 
Bose and Mahalanobis [33] provide formulae for computing the experimental 
unit yields making use of the combined weight of the mixed-up experimental 
units. Ап application is made in their paper. 

In other situations the experimenter may be short of material for one or 
more treatments but has an excess of material for other treatments. The 
experiment may be designed with “missing plots” for some of the treatments 
[164], and the “missing plots" filled in with the excess material of the other 
treatments, The missing experimental units are designed into the experiment 
atrandom. One analytical procedure is to run an analysis on only the replicates 
for which all treatments are present, and a second analysis on only the 
treatments which are present in all replicates. The procedure does not make 
use of the substituted plots. Pearce [240] presents a procedure for utilizing 
all experimental units and illustrates the method with an example. In the 
same paper, Pearce [240] presents a procedure for analyzing an experiment 
in which one treatment appears twice in one replicate but does not appear in 
the second replicate, with the reverse situation being true for a second treat- 
ment. Àn interchanging of the treatments in this manner occasionally occurs 
inlaboratory work. The analytical procedure is not difficult and is recommend- 
ed for experiments in which the treatments have been interchanged. 

In certain experiments a treatment is applied in sequence to an experi- 
mental unit. The experimenter may inadvertently apply the wrong treatment 
at some stage in the sequence. Grundy [139] developed the statistical analysis 
appropriate for analyzing data from an experiment of this sort. 


V-2 Least Squares Estimates and Expectation of Mean 
Squares 
V-2.1 ONE UNIT PER EXPERIMENTAL UNIT 
If a single observation is made on each experimental unit of a randomized 
complete block design and if the effects are additive, the linear model, 

Ху = nd ri e; Gi, (V-6) 
may be assumed to represent the yield of any plot, where y represents the 
population mean value, т; = effect common to ith treatment, p; — effect 
common to jth replicate, and e;; is a random error component. i = 1, 2, +++, v; 
j= 1, 2, +++, п. The above linear equation may be put in the form of a mul- 
tiple regression equation if written in the form, 

Xy = p + тА + Хз + es, (V-7) 
where X,’ and Х have the values of zero or 1. X; takes on the value 1 in all 
cells of the two-way classification where 7; is present and zero elsewhere; 
likewise, Хз, has the value 1 in replicate j and zero elsewhere. 
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The least squares estimates of p, ri, Ta ***, To р, Pa °° *, Pn (п = number 
of replicates) are obtained as before; i.e., by differentiating the residual sum 
of squares with respect to the variables and equating the results to zero. The 
sum of squares to be minimized is 

o -n 
Эха inr) = В. (V-8) 
Tel j=l 
Partial differentiation of equation (V-8) with respect to р, і, and r; results 
in the following system of equations: 


aR 


m7 У(Х; — д-р) = 0; (У-9) 
oR 

3; -22 (X; === п) = 0; (V-10) 
oR А 

an 2 (Ху—а— Ь—гў = 0. (V-11) 


A, ti, and гу, the least squares solutions of the equations, make the residual 
sum of squares a minimum. The set of differential equations leads to the 
following normal equations: 

Equation for à: 


УУХ = X.. = nk + Èr + тй. (V-12) 


Equations for treatment effects h, +++, l»: 


УХ; = ХА. = nh + Dory + пй, (V-13) 

j С 
УХ = Xo. = nh + Dori + пр, (V-14) 

j J 
УХ; = Х,. nt, + Dory + па. (У-15) 

3 1 

Equations for replicate effects, г, T% * **, гь: 

Xa = Dt: + or + vp, (V-16) 
Xa = Dili + ore + op, (V-17) 
Xen = Dhi + vrn + ор. (V-18) 


In order to obtain и 


Ў nique soluti i 1 
coefficients, the followin q utions for the v + n + 1 partial regression 


8 restrictions are imposed: 


Di = 0 = Dry. _ (Y49) 
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Since Er; = 0 and Ep; = 0 in the population, this is a reasonable restriction. 
Now, the least squares estimate of the experimental mean is 


1 ХЕ. 
й = ank — Dr; + Х.) = йр T 1, (V-20) 
of t; is 
Ху 4.) NES А 
= = a — 2, (V-21) 
and of r; is 
nu gor (V-22) 


The variance of is 


вр = Би ү 


2 7 + onis + ы + Dzs | A 


e (V-23) 


Бао 

Сас (0 cium 

if it is assumed that the r= Do; = 0. 
The variance of any /: is 


El — т) = ES - Hi — E(r?) 


n 


ny + пт: + Уру + Lei ny EnEn t vo; zr« 
ja 4 п ES, nv — E(v?) · 
Da ‚ (VM) 


2 2 — 
= Er)? + — Z — (тё) = e. р 


2 
T b . cèln— 1) A 
In a like manner the variance of any г; 18 шше RS the covariance of any 


tit; (ii) or of any гуту, G ғ J’) is —c2/nv, and the covariance of lir; liĝ, or 

of гуй is equal to zero. 
Instead of the variances or covariances 'of the least squares estimates 

the expectation, or average value, of a sum of squares after fitting certain , 


constants may be desired. For this case, it is assumed that the v; ру, and 
ej are random variates from populations with mean zero and variances 
equal to E(r?) = от, Elp) = сь and E(c) = сё, respectively [79, 111, 
155, 169, 175, 290]. In obtaining the expectation or average value of various 


mean squares, the restriction that DET = Ур; = 0 is not imposed, but the 
Y = 0 = Dor; still holds. 
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The expectation of the total sum of squares after fitting the estimated 
constants П, t;, and г; is 


B( ZEX? - ax. - xx. Ун) 
= (xxx E IA LEX X2 


v nv 
= [ж + rit pst 6)2— re tabs + mietit) 


EE eeren heu t €j)* 
v 


7 
petn tt т) E v Pn) t єп d «J| 
ni 
= no? + nio? + noe è + nve? — пли? — nvo,? — 002 — vo? 
— nop? — no? — noo? — no? + nop? + no? + vo ? + сё 
= (nv 0 = п + 1)сё = (n — 1)(@— 1). (V-25) 


The expectation of the sum of squares due to fitting the і; only is ob- 
tained as the difference of the expectations for the residual sum of squares 


after fitting the constants д' and г; and for the residual sum of squares after 
fitting р, t;, and rj; thus:! 


gx: ерх. EE) (ena = ax, Жог; _ Dike) 
= EEGX.] = p| Z% e 4 


п nv 


= пу? + noo? + 00,2 + ve? — пон? — no? — to — с 
= (o — 1)(02 + ne), (V-26) 


which is the expectation of the treatment sum of squares. Likewise, the ex- 
pectation of the replicate sum of squares is 


DX M АШ? 
ol 22 X2] ред (V-27) 
and of the total sum of squares is 


X.2 
| xxx; = | = п(0 — l)e? + (п — 1)o,2 + (nv — Yok. (V-28) 


It is not necessary to assume that the т; and (or) the p; are random variates. 
If the т; are assumed to be fixed effects, then Er? = 72 x c, and Yn = 0. 


The expectation of the treatment sum of squares is (v — 1)e? + IT 
os ia bea more appropriate form of the expectation for the treat- 
sum of squares in many experiments than is equation (V-26). Like- 


!See footnote on page 103. 


——— 
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wise, the expectation of the replicate mean square for fixed p; is obtained as 
а? + эў p7/(n — 1). 
J 


V-2.2 k UNITS PER EXPERIMENTAL UNIT 
If k observations are made on each cell of a two-way classification, the 


linear model, 
Xin =B + Ti + pi + тру t е (V-29) 


may be assumed to represent the yield of any observation, where џи represents 
the population mean value, r; = effect common to the ith treatment, p; = ef- 


fect common to the jth replicate, 7p;; = ап effect common to the ith treatment 


in the jth replicate, eijn = а random error associated with the ijhth observa- 
tion, i= 1, 2, +, uj = 1,2, wee, n; and h = 1, 2, +++, k. 


The least squares solutions of ш Ta pj, and тр:; аге obtained as before; 


thus: 
aR _ y YXXQa-hR-ü-n-n)-06 (080) 
on T h 
E ete) сщ (V-31) 
i j^» 
on = —2},; (Kian T 0-5 Tits) 0 (У-32) 
дг; тһ 
апа 
SR оо nc Generi) с\й (V-33) 
ort; ^ 
where 
R= 2222 Kn —p—t — rj — Thi)? (V-34) 


h make equation (V-34) a minimum. 


A, ti, rj, and гізу are the estimates whic’ f 
leads to the following set of normal 


The above set of differential equations 


equations: 
Equation for A: 
X... = nko + vk ri kE р угу + покр. (V-35) 
Equations for t;: 
Xi.. = nkt; + kr + rl) + nkf. (V-36) 
Equations for r;: 
"n ЕР? s + rl) + vkr; + vb. (V-37) 
Equations for гіх: 
(V-38) 


Xij = k(l; + г; + г + f). 
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In order to obtain unique solutions for the n + v + по + 1 partial re- 
gression coefficients, the following restrictions are imposed: 


DEO; (V-39) 
225 = 0, (V-40) 
and 
Улу = 0 = У, (V-41) 
i j 
With these restrictions the least squares estimates are 
A UE 
Unk = 1, (У-42) 
Ф = = 
t= T R = ži. — &, (V-43) 
Х.;. ho £ 
n=- k a= 85-4, (V-44) 
and 
Жу: = = = = 
ry = > — tk — ry — fh = By — 4. — By 2. (V-45) 


The variances and covariances of the above least squares estimates may be 
obtained as before and are left as an exercise for the reader. 


If it is assumed that the ту, ру, тру, and ¢:;, are random independent variates 
with zero means and variances єс, с, op, and oê, respectively, we may 
proceed to obtain the expectations of the various mean squares obtained in 
section V-1.5. 


The residual (sampling error) sum of squares obtained after fitting the 
constants д, t;, гу, and rl;; has the expectation: 


E33 X3 у= AX... = Б ОХ. — MX. $XrnX] 


"9505 - 5] 


А ОЛКЫ M (ат Кр Ерте жаш! 
T3. k 


E ут mos + Trp + со) E no(kp? + Ка? + ko,? AE ko,p? + с^) 


(V-46) 
In a like manner the expectations of the other sums of Squares are 
p| Xt ae Rts? 


nk = = (v — Y (e? + kop? + прод), 


g| xs Х..2 
ав 1 0 G2 + hou + фе), 


(V-47) 


(V-48) 
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and 


24 "E: 2% 2 
(V-49) 
If it is assumed that the r; pj, and тр;у are fixed effects which sum to 
zero in the experiment, then the sampling error mean square has expectation 
cê; the interaction = “experimental error" mean square has expectation 
se + КУУ (рт) (п — DO — 1); the treatment mean square has expecta- 
tion o2 + nk? т/(ш — 1); and the replicate mean square has expectation 
с + sky p?/(n — 1). For an experiment in which all effects are assumed to 
be fixed, the sampling error is the appropriate error mean square for testing 
the null hypotheses of the effects ру, Ti and рт;;. This was explained in example 
V-2. The nature of the material determines the linear model. The assumptions 
are made by the experimenter. 
The above expectations are difficult to obtain if the pumber of individuals, 
kij, per experimental unit varies. A discussion of the various expectations may 
be found in the literature [79, 111, 155, 169, 175, 290]. 


V-3 Development of Formulae for Missing or 
Deleted Values 
If an experimental unit is missing, the yields for an experiment designed 
Әз a randomized complete block may be represented as 


licate 
Treatment - Вараб Totals 
E a &— э = 
rs EX 
1 Xa Xo As Xi a п 
2 Xn Хи } у 
› З Ху 
Y X Xa Xvo " 
° Meer 
"Totals Xa XX Xo is 


where €. ; » v = number of 
u is the missing value, n ; & 
treatments, Ху = ЖЕШ, and the various totals are listed UN did first 
and the last column. There is no loss in generality in Вјана a EU 
A and column. If the analysis of variance оп the values in the 
computed, the error sum of squares is з ] 
v 2 
(Х, ER + EK | [К+ + ZA 
1- 2 Do cur EE 
Ra Ži 3 SX? 04 Sa aE — " 
+ K+ Ху 


nv 


(V-50) 
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ӘН _ og _ 206- x Хм) _ 20a Xu) 4 20C. + Xu) 


3X. 3 Er 0. (V-51) 
Solution of equation (V-51) for €; results in the following: 
_ Xi. + nX- X.. 1 
Xu = GEG) (V-52) 


which is equation (V-5). 
If more values are to be estimated, say X, and Xa, we follow the same 
procedure and obtain two equations for estimating the missing values, 
ro nX.4 + (v — А. + X. — X.. 
(v — 2)(n — 1) 


(V-53) 
and 


пх. + (v — DX. + X. — X.. є 
Rn = GE DIO = D d (V-54) 


If Xu and Xs are missing, the two equations for estimating the missing 
values are 


Ly = C= D — D(X a +X.) — о. — nX – (n — v — n)X.. 


(nv — 0 — n) (mw —v—n4- 2) (V-55) 
and 
= (n — D)(v — D)(nXa + 0X2.) — 9X. nX4 — (n — v — )X.. 
5 (nv — v = ait = v—n F 2) n - (V-56) 


The procedure may be continued to obtain the equations for various combina- 
tions of missing values. The general formula for missing values becomes too 
complex for easy manipulation [111]. An experimenter may either use the 
iterative method suggested by Yates [316], or he could develop formulae 
for his particular needs by the method outlined above, 


CHAPTER VI 


The Latin Square Design 


VI-1 icati 
Applications of the Latin Square Design 


VI- 
I-L1 INTRODUCTION 


In : 

E a complete block designs the ге 
fiscal nditiher nen must appear together in a block an equal or propor- 
зев) aen ke rather than being allotted at random over the whole 
square design, two m the completely randomized design. For the latin 
area divided ae Y strictions are imposed; namely, that for an experimental 
row and once in a si and columns, each treatment must appear once in а 
into replicates i olumn. Thus for latin squares, the treatments are grouped 
Elihiinaton к. in two ways, once in rows and once in columns. Through the 
the residual or row and column effects from the within treatment variation 
removal of ees variance may be considerably reduced. The effect of the 
trated after th p and column variances on the residual variance is illus- 
Latin Ане ibus os on the construction and design of latin squares. 
Work, They а гече have a wide variety of applications in experimental 
medical, ma 4 used in industrial, laboratory, field, greenhouse, educational, 
group of чаг ейпв, and sociological experimentation; from comparing a 
comparing NN or fertilizer treatments to testing biological assays, from 
and er * differences in the laboratory to comparing weaving proc- 
ature citati sting tea to comparing p n a hospital. Reference to 
е wide vari ons of numerical examples of the latin square design indicates 
used, Man iety of experiments for which the latin square design has been 
origina] Peg uses may be found in other literature citations in which the 

Опе. are not reproduced. 
as but to consult researc! 


lati 
n 
ч Е 
Square design. Despite its popularity 
if two or m 


2x2,8X 
dom associate 


striction is imposed that all 


liter. 


popularity of the 
the latin square design is practical 
ore squares are used, it is 
3, and 4 X 4 latin squares, 
d with the residual sum 
freedom in the error term, it is recom- 


d or that another design be used 
licates as treatments, the 


hers to determine the 


on] 

HN e to twelve treatments; 
ere are E fewer treatments. For the 
Squareg ero, 2, and 6 degrees of free 

mended th and with such few degrees of 
Ince the ee the latin square be repeate 

atin square design requires as many rep 
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design is seldom used for more than ten to twelve treatments. With regard 
to the use of the latin square design and with regard to the high precision 
(standard error less than 2 per cent of the mean) frequently obtained, Fisher 
[126, sec. 33] says, “If experimentation were only concerned with the com- 
parison of four to eight treatments or varieties, it (the latin square design) 
would therefore be not merely the principal but almost the universal design 
employed." 


VI-1.2 ADVANTAGES AND DISADVANTAGES 
The advantages of the latin square design over other designs are 


(i) With a two-way stratification or grouping the latin square controls more of 
the variation than the completely randomized design or the randomized 
complete block design. The two-way elimination of variation often results 
in a small error mean square. 

(ii) The analysis is simple; it is only slightly more complicated than that for 
the randomized complete block design. 


(iii) The analysis remains relatively simple even with missing data [1, 14, 33, 85, 
216, 316]. Analytical procedures are available for omitting one or more treat- 
ments, rows, or columns [321, 333]. 


The disadvantages of the latin square design are 


(i) The number of treatments is limited to the number of rows and columns, 


except as noted above [321, 333]. For more than ten treatments the latin 
square is seldom used. 


(ii) For fewer than five treatments the allocation of degrees of freedom for con- 
trolling heterogeneity is disproportionately large. Even with repetition of 
squares a disproportionate number of degrees of freedom is associated with 
rows and columns for two, three, and four treatments. When corrections are 
made for degrees of freedom (formula (I-1)) the latin square may not be as 


efficient as the randomized complete block or completely randomized designs 
for two, three, and four treatments. 


Cochran [45, 47] reported that the efficiency of the latin square design 
relative to the completely randomized design is 222 per cent, and, relative 
to the randomized complete block design, is 137 per cent for the experiments 
grown at Rothamsted and associated centers during the years 1927 to 1934. 
This means that ten replicates of a completely randomized design or six 
replicates of a randomized complete block design are roughly equivalent to 
four or five replicates of a latin square. Similar results were obtained at the 
TA of Saskatchewan by Ma and Harrington [201], who found that 

e randomized complete block was only 79 per cent as efficient as the latin 


square; i.e., four replicates of a latin 1 i 
i square are approximately equivalent to 
five replicates of a randomized complete block дыл. vu 
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VI-1.3 CONSTRUCTION AND ARRANGEMENTS 


For the discussion on the construction of latin squares, it is advantageous 
to define or explain some of the terminology used in connection with these 
designs, Fisher and Yates [129] give the following definitions: 


(i) standard square. A square is said to be standard if the first row and first column 
are ordered alphabetically or numerically. There are as many standard squares 
for a k X k latin square as there are types which cannot be converted into 
another square by a reshuflling of rows and columns. 

(ii) conjugale square. Two standard squares are conjugate if the rows of one are 
the columns of the other. 

(iii) self-conjugate square. A square is self-conjugate if its arrangement in rows 
and columns is the same. 

(iv) adjugate sel. By permuting with each other the three categories, rows, columns, 
and letters, six sets (not necessarily all different) are formed. The resulting 
sets are said to be adjugate. 

(v) self-adjugate set. A set is self-adjugate if a permutation of the three categories, 
columns, rows, and letters results in the same set. 


For the 2 X 2 latin square, there is only the one standard square, 


tandard square result in the same 


The two conjugate squares for the above s 1 
hat the 2 X 2 latin square 18 also 


arrangement as given above. This means t KE. 
self-conjugate, since the letters in a row are in the same order as those in the 


corresponding column. By interchanging rows with columns, columns with 
letters, and letters with rows, three latin square arrangements are obtained. 
The conjugate of each of the above three sets may be obtained resulting In 
| the six adjugate sets. These sets give the single square for the 2 X 2 latin 
square; hence, the 2 x 2 latin square is self-adjugate. 

Likewise, for the 3 x 3 latin square, there is only one standard square, 
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The square is self-conjugate, since the arrangement of the letters in rows 
and columns is the same. To illustrate the construction of the adjugate set, 
interchange the column order with letter order in the square, 


to obtain the square, 


Upon interchanging rows 


a total of six square: : 
B dan secun id or the adjugate set; these squares are the same as the 


and columns of the above three squares, we obtain 


reordered by interchanging rows and by interchanging 


columns. H. i 
owever, the six squares of the adjugate set are not all the same. 
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There are twelve possible arrangements for the 3 X 3 latin square: 


There are 31 (3 — 1)! = 12 arrangements for the 3 X 3 latin square, of which 
1 are nonstandard squares. 


Four standard squares are possible for the 4 X 4 latin square: 


| 10 
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All four standard 4 X 4 latin squares are self-conjugate. For each standard 
square, there are 4! (4 — 1)! — 144 possible arrangements, resulting in a 
total of 576 possible arrangements; these have been tabulated by Kitagawa 
and Mitome [187]. Of the 576 arrangements, 572 are nonstandard squares, 
and the remaining are the four standard Squares. 

For the 5 X 5 latin square, there are twenty-five standard squares and 
their conjugates, plus six self-conjugate squares, resulting in fifty-six standard 

. Squares. Also, there are 56(5 !)(4!) = 56(2880) = 161,280 possible. arrange- 
ments [129]. 

The number of possible arrangements increases rapidly as the size of the 
latin square increases. It is obvious, then, why the possible arrangements for 
all k X k latin squares have not been tabulated. Fisher and Yates [129] give 
the standard squares for the 4 x 4 and 5 X 5 latin squares; they [128, 129] 
give the five conjugate pairs of transformation sets and the twelve sets con- 
taining conjugates for the 6 X 6 latin squares. Norton [233] has tabulated 
the 562 sets from which it is possible to generate the 16,927,968 standard 
squares for the 7 X 7 latin square.! To date, all the standard squares for 
higher ordered latin squares have not been tabulated. 

Further discussion on the formation of latin squares appears in the litera- 
ture citations [126, 128, 129, 175, 207, 308, 317] at the end of the book. 
A number of literature citations relating to the construction of latin squares 
appears in references [113] and [129]. Some of the topics related to orthogonal 


latin squares [129] and graecó-latin squares [126, 129] are discussed in later 
chapters, 


Fisher and Yates 
10 X 10, 11 X 11, an 
make up hi: 


[129] list sample squares for the 7 x 7,8х 8,9 х9, 
d 12 X 12 latin squares. The experimenter may also 
s own sample squares for the larger squares. 


VI-1.4 RANDOMIZATION 


In designing an experiment as a latin s 


18, i 
rd ih Sur 22:306, 1951) has found the correct number of 7 X 7 latin 


quare, one of the possible arrange- 
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ments is selected at random. The procedure is quite simple for the 2 X 2 
latin square; of the two arrangements, 


one is chosen by the toss of a coin or from a table of random numbers. The 
letters A and B represent two treatments under consideration. Likewise, for 
a 3 X 3 latin square with the treatments A, B, and C, one of the twelve 
arrangements listed above is chosen at random. 

All possible arrangements of the 2 x 2 and 3 X 3 latin squares may be 
found in several references. The remainder have not been enumerated to date. 
Therefore, another method for selecting a random arrangement must be pro- 
vided. In accordance with the rule for obtaining latin square arrangements 
аз set forward by Fisher and Yates [129] the following procedure may be 
utilized for obtaining latin square designs for experiments: 

G) 2 x 2 latin square. Randomize the arrangement of the columns of the stand- 
ard square, or, alternatively, select one of the two arrangements at random. 

Gi) 3 X 3 latin square. Randomize the arrangement of the three columns and of 
the last two rows of the standard square or, alternatively, select one of the 
twelve arrangements at random. 

(i) 4 X 4 latin square. Select one of the four standard squares at random and 
then randomize the arrangement of the columns and the last three rows. 
Also, the procedure of selecting one of the 576 arrangements at random may 
be used [187]. Hu. 

Gv) 5 x 5 latin square. Select one of the fifty-six standard squares at random an ' 
then randomize the arrangement of the five columns and the last four rows; 
this results in the selection of one of the 161,280 arrangements. 

(V) 6 X 6 latin square. Select one of the 9408 standard squares at em is 
randomize the arrangement of the columns and the last five rows; alter- 
natively, select at random one of the sets enumerated by Fisher and ir 
[129] in proportion to the number of standard squares possible in the se 
and then randomize the allotment of the letters $e the treatments, the arrange- 
ment of the columns, and the arrangement of the rows. 

(vi) 7 X7 latin square. Select one of the 16,942,080 standard squares at randon 
and then arrange all columns and the last six rows at random; alternatively, 
follow the second plan for the 6 X 6 latin square. 

(vii) 8 X 8 and higher latin squares. Select one of the tabulated squares or aon ee 
one and then arrange the columns and the rows at random and assign x 
letters to the treatments at random or follow the alternative procedure ou 
lined below. " 

The Procedure given in (vii) may be used to construct the 5 X 5 and ys ЕА 
latin Square arrangements. However, it must be remembered that this met 
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does not result in all possible arrangements, since certain configurations are 
excluded. Little harm from this procedure is likely to result unless the latin 
squares are used extensively or unless the results of several experiments are 
summarized. Yates [317] has discussed the theoretical basis for randomization 
of latin squares. The student is referred to the above reference for a further 
discussion of this topic. 

An alternative procedure for obtaining a random arrangement may be 
used for the larger latin squares. It is observed (i) that the method is biased 
in that all arrangements do not have an equal probability of being drawn and 
(ii) that this method gives all possible arrangements. An outline of the pro- 
cedure follows:! 


(i) Assign letters to first row at random; this results in k! permutations of the 
letters in the first row. 


(ii) Assign the remaining (k — 1) letters at random in the first column resulting 
in (k — 1) ! permutations of these letters. 

(iii) Continue the process until all rows and columns are filled, excluding letters 
which have already appeared in a row or in a column. 


As a further suggestion, one might take the resulting square from the above 
procedure and randomize the rows, columns, and letters. It is not known 
what effect the second randomization has on the bias. The bias may be illus- 
trated fairly simply with the 4 X 4 latin square. If the above procedure is 
followed, it will be observed that the probability of selecting two of the four 
standard squares is 4 each, and the probability of selecting the other two 
standard squares is 2 each. 


VI-1.5 EXPERIMENTAL LAYOUT 


The general concept is that the latin square design should occupy a square 
or nearly square experimental area. In practice, this is generally true for 
field experiments, but it is not necessary; usually the rows are perpendicular 
to the columns; thus: 


Column number 


Row number 


Suggested in a discussion with F. Yates and P. J. McCarthy. 


$ VI-1.5] Experimental Layout 143 


Now, the purpose of the latin square design in field and laboratory experiments 
is to control variation in two directions, such as down the field and across 
the field, across the greenhouse bench and along the bench, or from two 
sources. In some instances, it may be desirable to keep the treatments in a 
row in a compact block in such a way that the blocks are the rows and the 
order within the blocks represents the columns. Such an experimental design 
might be illustrated by the following: 


Row 1 or Row 2 or Row 3 or 
Block 1 Block 2 Block 3 


> 


Row 1 c 
B 
Cc 
Row 2 B 
A 
Row 3 


The above design might be used on a single row or set of rows in a grape 
vineyard, where treatment A represents no spray ог check, treatment B 
represents spray 1, and treatment C represents spray 2. The sprayer could be 
equipped with two tanks and the various treatments applied as indicated in 
the design. 

In some instances, more replication is desired. The procedure here is to 
select, at random, the desired number of arrangements of the latin square. 
Suppose that nine replicates for the three treatments A, B, and C are desired. 
The field design could be of the following form for three rows (or three sets 


of rows) in a grape vineyard: 


С А 
А Шз „л у. 
B [e] 
B pue 
С EYEE m 
A (eS 
A ктү ыгы >. 
B a 
G B 
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or for the three locations, farms, or positions: 


Square I Square II Square III 


Modifications of latin square designs result in other configurations, depending 
upon the nature of the experiment and the experimental material. The first 
of the two designs listed above may be useful in baking or cooking experi- 
ments which are conducted over a period of days. If it is possible to bake 
three cakes per day and if the worker tires as the day progresses, it may be 
desirable to have each kind of cake baked in all three orders of baking. The 
3 X 3 latin square design satisfies these requirements. The experiment may 
be repeated on a second set of three days, using a different randomization. 
VI-1.6 STATISTICAL ANALYSIS FOR ONE OBSERVATION PER EXPERI- 
MENTAL UNIT 


The breakdown of the total degrees of freedom in the analysis of a 
k X k latin square design is 


Source of variation Degrees of freedom Mean square 
Row k-1 R 
Column k-1 С 
"Treatment k-1 T 
Error or residual (k — 1)(k — 2) E 
Total к —1 


Тһе row sum of squares is obtained by summing the squares of the row 
totals, X;.., dividing by Ё, and then subtracting the correction term (equal 
to the grand total, X..., squared and divided by Ё?); 


Х.2 + Х,.2 + P AE Ж Apt MA 
k юю “Sk k? 
= 2..Ху.. EX + +++ + Zr Xr. — EX... = КУ. — Ф), (VI-1) 


where Z;.. = row mean and 2 = experiment mean. In a similar manner the 
treatment and column sums of squares are obtained as 


ЕЭ „Жог 


(VI-2) 


and 


e oy 
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respectively, where X..; represents the treatment total and X.;. represents the 
column total. The total sum of squares with Ё? — 1 degrees of freedom is 
obtained by squaring the k? determinations, X;;,, and subtracting the cor- 
rection term, 
2 

Уха T (VI-4) 

Em 
i = 1,2, ---, k and the subscript j = 1, 2, +++, k to give the k? observations. 
Within each row and column the ireatments are arranged to appear once in 
each row and once in each column. Thus, for h = 1, 2, •• •, k, there are two 
restrictions imposed. The error or residual sum of squares is obtained by sub- 
tracting the row, column, and treatment sums of squares from the total, 


IX S$EXGi SUN DEM А 
Xx,-LXslEXS усе иеЫ 

_ The estimated standard error of a difference between two means is ob- 
tained from the formula, 


йс (VL6) 
k 


In the event that one of the treatments yields more variable results than 
the other k — 1 treatments, Yates [316, 321] gives a method for determining 
the error of the more variable treatment and the error of the remaining k — 1 
treatments which have approximately the same amount of variation. In addi- 
tion, he describes a procedure for calculating missing values and for making 
comparisons among the means. 


Example VI-1. One of the studies of the Regional Cooperative Project S-5 reported 
by the Southern Cooperative Group is concerned with the variation in moisture con- 
tent from plant to plant and from leaf to leaf of turnip greens. Knowledge of the 
magnitude of these sources of variation is of importance in sampling studies for chem- 
ical determinations. In order to study the sources of variation in sampling turnip 
greens, Peterson el al. [247] set up a 5 X 5 latin square with five different plants as 
the rows and five different leaf sizes, ranked from smallest to largest and designated as 
t B, C, D, and E as the columns. In addition, it was desired to measure the relative 
Variation due to time of sampling. The treatments in this experiment are the five times 
9f sampling. The data on moisture content of deribbed turnip leaves are presented in 
table VI-1. The moisture contents given in the table are in percentages and are coded 
Values obtained by subtracting 80 from each of the original moisture contents. The 
Sums of squares are obtained from the data and totals in table VI-L. Application of 
formula (VI-1) results in the following sum of squares for rows or plants: 


pe ena aae » 19.68 = 1320.2443 — 1290.8212 = 29.4231. 


Application of equations (VI-2), (VI-3), and (VI-4) results in the sums of squares for 
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times, leaf sizes, and the total corrected for the mean. The error or residual sum of 
squares, equation (VI-5), is 


1353.5604 — 1320.2443 — 1313.8162 — 1291.3635 + 2(1290.8212) 
= 62.1392 — 29.4231 — 22.9950 — 0.5423 = 9.7788. 


The above sums of squares and the corresponding mean squares are presented at the 
bottom of table VI-1. 


TABLE VI-1. Moisture content of turnip greens (minus 80) 


iii. 8.95 
de T:5h 
iv. 9.99 


7.85 


[- [e| 


36.59 36.22 36.03 34.50 36.30 
87.32 87.24 87.21 86.90 87.26 


Uncoded mean 


Plants (row) 

leaf sizes (column) 
Times (treatment) 
Error 


Total 


Correction for mean 


Total uncorrected 


An F test indicates that differences exist among plants and among leaf sizes, but 
that time differences are negligible, thus: 
_ 7.3558 
= -glag = 9-03 > Fa(4 and 12df) = 5.41, 
p _ 5.7488 


= 58149 = 7-05  Fa(4 and 12df) = 5.41, 
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and 


0.1356 
= .8149 


where the F« values are obtained from table 11-8. It may be of interest to determine 
if the time means are more alike than expected relative to the error variance; thus 
F — .8149/.1356 — 6.0, which is greater than the tabulated value for Fos (12 and 4 df) 
= 5.91; this is the 10 instead of the 5 per cent level, since the F test was made after 
observing which mean square was the smaller. Hence, the mean square for time of 
sampling is small but probably not unusually small. If it were significantly smaller 
than the error mean square, then competition, in the general sense, would be suspected, 
and we should determine the element in our experimental technique which gives either 
an error mean square that is too large or a times mean square that is too small. 

As a further subdivision of the sum of squares, it may be of interest to obtain the 
linear, quadratic, cubic, and residual components for leaf size. Without additional 
knowledge on leaf size, we could assign the values —2, —1, 0, 1, and 2, or 1, 2, 3, 4, and 
5, and obtain the components by the method described by Snedecor [273, Ch. 14 and 
15]. In this particular experiment, such comparisons are unrealistic, since the actual 
weights are available and the authors [247] used a covariance analysis on moisture 
contents for leaf size. 

The standard error of a time, plant, or leaf-size mean is 


8149 
c КЕЕ = 404. 


The standard error of а mean difference is 


Ec BEND = 571. 


= 0.17, 


The coefficient of variation is 


Since the heterogeneity was controlled by the two groupings, plants and leaf sizes, 
the experimental design was satisfactory. If either plant variation or leaf-size variation 
is ignored, the residual variance increases considerably. This means that the latin 
Square design is more efficient that either a randomized complete block or a completely 
randomized design for comparing treatment (time) means. If the plants were the repli- 
Cates, the efficiency of this latin square relative to a randomized complete block design 
is 


k- 1)(Е — 2) + 1) ((k — 1) + 3)C + (& — DE (VI-7) 
(& —D(k—2) + ae —1)?+ ) kE 


Where the symbols are defined in the analysis of variance table at the beginning of 
this Section; for this example, the efficiency is equal to 


(12 + 1) (16 + 3) (Eum 4E еш) йлы 
(12 + 3) (16 + 1) 5(.8149) 
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Approximately eleven replicates of а randomized complete block design with panty 
as replicates would have yielded a standard error of a mean equal to that мыр. 
with five in the latin square. It is probably unrealistic to consider leaf sizes as rep i- 
cates and to ignore plant differences, but if the columns were the replicates, the 
efficiency of the latin square relative to the randomized complete block is obtained 


from the formula, 
[(k — 1)(k — 2) + 1J[(k — 1)? + 3J[R + (k — DE] (VI-8) 
[(k — 1)(k — 2) + 3J[(k — 1)? + ПКЕ] : 
which for this latin square is 


13 19 tasse + 4(.8149) н 
(3) 17 5(.8149) 252 Der сеп: 


Approximately thirteen replicates of a randomized block design of this type would 
have been equal to five replicates of the latin square. 


Similarly, the efficiency of the two-way grouping in the latin square relative to 
no stratification, as in the completely randomized design, is 


(4 + 1) (dh + Me = 1А + С) + 16-1 (k — Dk — 2))E 
аһ + 3/\df2+1 (k — DE 


(6 — D)(k — 2) -- AM k(k — 1) +3 R FC t (k — QE. С 
Y (& -)6-2-43 DES zl je +E hs) 


The efficiency of the latin square relative to the completely randomized design is 


13 237.3558 + 5.7488 + 4(.8149) £ 
SE 6.8149) 318 per cent, 


or, approximately sixteen replicates of a completely randomized design would have 
been equivalent to the latin square with five. 


VI-1.7 STATISTICAL ANALYSIS FOR A GROUP OF LATIN SQUARES WITH 
А SINGLE DETERMINATION PER PLOT 

In some cases, it is desirable to have more than a single latin square at a 
single location or to have a single latin square at several locations. For the 
2 X 2, 3 X 3, and sometimes the 4 X 4 latin squares, it is often desirable to 
have two or more squares at a location in order to have sufficient degrees of 
freedom in the error sum of squares. The procedure of designing an experi- 
ment in more than one latin square has already been discussed. The break- 
down of the total degrees of freedom in the analysis of variance follows: 


Source of variation 


Degrees of freedom Mean square 

Squares (or locations) s—1 S 
Rows within squares s(k — 1) R 
Columns within Squares s(k — 1) Cc 
"Treatments k—1 T 
Treatments X squares (s — 1)(k — 1) TS 
Residual within squares s(k — 1)(k — 2) E 
Total 


sk? — 1 


= ee ee 
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The “treatment X squares” sum of squares may be pooled with "residual 
within-squares’’ sum of squares if there is no treatment-square interaction. If 
the squares are at different locations, then, for some hypotheses, it is appro- 
priate to use the “treatment X square mean square" to test the treatment 
differences. Fisher [126, sec. 65] discusses the analysis for a group of latin 
squares and the appropriate error mean square for testing the differences 
among treatment means for various hypotheses. The analysis for two or more 
latin squares has been discussed by several authors. 

The formulae for obtaining the sums of squares in the above analysis of 
variance table are given below. 

For Я 
squares, У a S. ? жы (120) 
A sk 


for rows within squares, 


Sof SoMa! x m (VI-11) 


tel 


for columns within squares, 


ЕР СОЗА ЕНЕ "5 


for treatments, 


(VI-13) 


5; y X ү е | sue Xess (VI-14) 


for error pens within squares, Я 
; Xa ХХ? уай E ie 
ре Рз т = куга Di xtti nas) 
i=l \7.0,!һ 7 л 2 


апа for the total sum of squares, 
2 


Хы 
E ea 
The subscripts j, h, and g correspond to subscripts h, i, and j for the single 
latin square, as described previously, and i = 1, 2, +++, s. The treatments 
Х squares sum of squares in formula (VI-14) may also be obtained by subtract- 
ing the treatment sum of squares from the treatment-within-squares sum of 
Squares. An application of the above formulae is illustrated below. 
The efficiency of the latin square design relative to what it would have 
been had a completely randomized design been used is! 


2 —1(R+ 0) + s(k — 1E ^ 
Ce рк ыш (VI-17) 


"The correction for the difference in degrees of freedom associated with the two mean 
Squares is not included. 
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where E’ is the mean square from the pooled sums of squares for error within 
squares and treatments X squares. The other symbols are defined above. 
The efficiency of the latin square design relative to what it would have been 
had the rows been used as replicates within each square is (for E’, as defined 
in formula (1-17): 
s(k — 1)C + s(k — 1)?E’ 
SKE-IE^ КЕ (VI-18) 


If C is replaced by R in formula (VI-18), the formula for efficiency is obtained 
for using the columns as replicates. Also, the efficiency of the latin square 
relative to what it would have been had a completely randomized design been 
used in each square is! 


s(k — 1)(R + C) + s(k — 1 я 
oT =a ee WH 


Ezample VI-2. Dominick [92] has used sets of latin squares in marketing research. 
The data, pounds of apples sold per hundred customers, presented in table VI-2, 
represent a part of one such set. Data for the remainder of the set are given in problem 
VI-3. Four treatments on McIntosh apples were compared in four stores; treatment 
A = regular apples, B = apples of 2.25-inch diameter at a lower price, C = carefully 
handled uniform apples of 2.5-inch diameter, and D = highly colored uniform apples 
of 2.5-inch diameter. It was suspected that the days of the week as well as the parts of 
the week might contribute to the variability. The first part of the week is Monday, 
Tuesday, Wednesday, and Thursday, and the second part of the week is composed of 
Friday morning, Friday afternoon, Saturday morning, and Saturday afternoon. 
From previous work, it was found that the two parts of the week contained about 
the same number of customers. 

The randomization scheme followed was a random allocation of the four standard 
squares to the two parts of each of the two weeks. Then, the randomization scheme 
of section VI-1.4 was followed for each square. 

The analysis of variance is obtained for each square separately, and the results 
are combined by the method of the preceding section. The separate analyses present 
no additional work, unless the individual mean squares are obtained, and may indicate 
the source of large variations. As a general rule, it is wise to study the individual 
analyses in connection with the combined analysis. The separate analyses and the 
combined analysis of variance are presented at the bottom of table VI-2. 

The sum of squares for weeks (squares), formula (VI-10), is obtained by adding 
the correction terms from the individual weeks and subtracting the new correction 
term, thus: 10050.06 + 11130.25 — 8232/32 = 13.78 = (401 — 422)?/32, with 1+ 1 
— 1 = 1 degree of freedom. 

The stores-within-weeks, days-within-weeks, and treatments-within-weeks sums 
of squares are obtained by adding the sums of squares from each latin square, 

_ thus: stores-within-weeks = 848.19 + 408.75 = 1256.94, days-within-weeks = 237.69 
+ 1080.75 = 1318.44, and treatments-within-weeks = 707.19 + 1146.75 = 1853.94, 


Тһе correction for the diff i 4 
pauses Gok hice. erence in degrees of freedom associated with the two mean 


= 


TABLE VI-2. Pounds of McIntosh apples per 100 customers purchased in four stores for 
four treatments in the first parts of two weeks 


Week 1 
Store number 


Store number 
3 lh | Total 


Day of week 


z5 RS 
B20 A22 D4 C25 
Dek cl BR А27 
Re pc B 22 


Day totals Wed. Thurs. 
205 155 265 
ss 


Week 1 5. = 
Te = 
Totals 167 = 
o 29:8 E о EZ 2 


Stores (column) 
Days (rov) 
Treatment 

Error 


Total 


Correction for mean 


Total uncorrected 


Source of variation 


Squares (weeks) 

Days within weeks 

Stores within weeks 1256.94 
Treatments within weeks 1853.94 


Treatment 1540.59 
Treatment x week 313.35 


Error within weeks 1065.37 
Total 


Correction for mean 


151 
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each with 3 + 3 = 6 degrees of freedom. The latter sum of squares is partitioned 
into the two components, treatment sum of squares, formula (VI-13), 


167? + 158? + 200° + 298? 823° 
2(4) 2(4)(4) 


= 1540.59, 


with 3 degrees of freedom, and the treatment X week (square) sum of squares, 
1853.94 — 1540.59 = 313.35, with 6 — 3 = 3 degrees of freedom. The other two 
sums of squares may be partitioned in the same manner as treatments-within-weeks 
for this particular example. If the two latin squares had been conducted during the 
two parts of one week, it would not be correct to partition the days-within-weeks sum 
of squares into the components days and days X weeks. Also, the above statement 
holds if eight stores instead of four had been used for the two 4 X 4 latin squares. 

The error-within-squares sum of squares is obtained by summing the individual 
error sums of squares, 451.87 + 613.50 = 1065.37, with 6 + 6 = 12 degrees of freedom. 
The total sum of squares for the two squares may be obtained by formula (VI-16) 
or by adding the uncorrected total sums of squares from the individual squares and 
subtracting the overall correction term, 12295 + 14380 — 8237/32 = 5508.47, with 
16 + 16 — 1 = 31 degrees of freedom. 

The treatment mean square is significantly larger than ordinarily expected in 
sampling from a homogeneous population; F — 513.53/88.78 — 5.78 is larger than 
Fos;(3, 12df) = 4.47 and almost equal to Fo(3, 12df) = 5.95 (see table II-8). There- 
fore, we reject the null hypothesis of no difference among the four treatments on the 
purchase of apples. If it is desired to compare the other mean squares with the error 
mean square, the F test may be used. However, for this particular experimental setup 
the stores-within-weeks and days-within-weeks sums of squares should be partitioned 
into the component parts prior to making any tests. 


The standard error of a treatment mean is sz = 4/E/sk = 4/88.78/8 = 3.33. 
The standard error of a difference of two treatment means is sj — NEG sts x) 
= 4/88.78/4 = 4.71. The coefficient of variation is 4/E/z = 324/88.78/823 = 37 


per cent. Application of Duncan's multiple comparisons test indicates that treatment 


D is different from treatments A, B, and C and that A, B, and C do not differ among 
themselves. ; 


The efficiency of the two latin squares used relative to what would have been 
obtained from a completely randomized design is equal to (formula (VI-17)), 


13.78 + 6(219.74 + 209.49) + 18(91.91) 
31091.91) X 100 = 149 per cent, 


where 91.91 = (313.35 + 1065.37) /15. The use of E" = 91.91 assumes а zero treat- 
ment X week interaction. If this is not true, then formulae (VI-17) to (VI-19) should 
be modified accordingly. Approximately twelve replicates of a completely randomized 
design would have been required to attain the same precision as the present design 
with eight replicates. Likewise, if the stores and weeks were the replicates of a гап- 
domized complete block design, the efficiency of the latin square is (formula (VI-18)); 


6(219.74) + 18(91.91) 
Sols 1 X 100 = 135 per cent. 


$ VI-1.8] More Than One Observation Per Experimental Unit 153 


ҮІ-1.8 ANALYSIS FOR MORE THAN ONE OBSERVATION PER EXPERI- 
MENTAL UNIT 

Several variations of latin squares are possible, and some of these will be 
discussed in later chapters. The example of the present section is a latin 
Square with more than one unit per plot. The particular breakdown of the 
degrees of freedom in the analysis of variance depends upon the nature of the 
material and the design of the experiment. 


Example VI-3. A 4 X 4 latin square design [288] was set up to compare the 
effects of four light intensities (D = dark or zero, L = 500, M = 900, and Н = 1200 
foot-candles of light) on the difference in bioelectric potential (in millivolts) between 
à point on the stem of the bean plant and the point at which the nutrient solution made 
contact with the stem. Since the difference in bioelectric potential required a period 
of 1 to 114 hours to become stabilized after a change in light intensities, it was neces- 
Sary to wait two hours after changing light intensities in order to obtain a reliable 
measure of the difference in potential between the two points measured on the stem 
of the bean plant. The first treatment was applied at 10:00 л.м., and the reading was 
recorded at noon. The second treatment was started at noon, and the corresponding 
reading was taken at 2:00 р.м. Likewise, the third and fourth treatment readings were 
Tecorded at 4:00 and 6:00 р.м., respectively. It was believed that time of day might 
have an effect on differences in bioelectric potential. Therefore, it was necessary to 
have the treatments (light intensities) applied once at each of the four times. A period 
of four days was required to run the experiment. The time of day was considered to 
be the row effect, and the day of the week the column effect. The light intensities were 
the treatments, 

С The original data with three plants per day exposed to each of four light inten- 
Sities and two readings on each plant under each of the light intensities are recorded 
in table VI-3. A different set of three plants was used on each of the four days. The 
order of applying light-intensity treatments was at random with the restriction that 
each of the treatments must occur in each of the orders over a four day period. The 
arrangement of the treatments (light intensities) and the various totals used in ob- 
taining the analysis of variance in table VI-4 are given in table VI-3. 

The sums of squares for the analysis of variance in table VI-4 are obtained in 
much the same manner as for previous examples. The total sum of squares is obtained 
Y Squaring the 96 determinations and subtracting the correction term, 

4440? 


64? + 65? + +++ + 53? + 30? — 10635 15,654.00. 


The row, column, and treatment sums of squares are, respectively, 
1112? + 1296? + 1068? + 964? _ за = 2403.33, 


3х2 х4 = 24 
2 2 
1226? + Meo 1135? + 1020? — d = 1032.58, 


and 


2 2 
эмеш. ш-н 4и уу 


for bean plants under four light intensity treat- 


TABLE VI-3. Differences in bioelectric potential (millivolts) between stem and nutrient solution s 
ments (D = 0, L = 500, M = 900, and Н = 1200 foot-candles of light) arranged in a 4 X 4 latin square 


1st det. 
Noon 2nd det. 
Total 


Cell total 


2:00 PM 


Cell total 


4:00 FM 


Cell total 


Treatment L 
26 54 5 
6:00 РМ à AS 53 
53 106 107 
Се11 total 266 


Column total 
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TABLE VI-4. Analysis of variance of the data in table VI-3 


Source of 
variation 


Degrees of Mean 


freedom 


Time of day (row) 
Day of week (column) 344.19 


Light intensities 


(treatment) 125.19 
Experimental error 556.20 
Among plants within 

cells 265.67 

Among plants in 

columns 379.27 

Remainder 225.13 


Between readings on 
the same plant 


The experimental error sum of squares is obtained by subtracting the row, column, 
and treatment sums of squares from the sum of squares of the k? = 16 cell totals 
squared; i.e., 


320? Rd UM 2 
0? + 308? + : + 294° + 2612 — ии — (2403.33 + 1032.58 + 375.58) 


= 212,498.67 — 205,350.00 — 3811.49 
= 7148.67 — 3811.49 = 3337.18, 
with 6 degrees of freedom. 


The sum of squares associated with the variation among plant totals in each of 
the 16 cells is 


124? + 129? + 67? _ 320? 
2 6 
= 220,936.00 — 212,498.67 = 8437.33, 


942 + 1072 + 60? 261? 
Tee MUI 


with 32 degrees of freedom. The sum of squares attributable to the variation among 
plants within days is 


329° + 485? + 412? 1226? 297° + 398° + 325? _ 1020? 
8 sa dep eR 8 24 


= 209,416.75 — 206,382.58 = 3034.17, 


with 8 degrees of freedom. Subtracting the above sum of squares from that for variation 
among Plant totals in each of the sixteen cells results in the remainder sum of squares 
which is a composite sum of squares of plants X treatments (ignoring rows) within 


columns, thus: 


8437.33 — 3034.17 — 5403.16, 
with 24 degrees of freedom. 


11 
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The sum of squares of the differences among readings on the same plant is ob- 
tained as follows: 
2 
64? + 60? — m 
= 2 — 80 2 
= 221,004.00 — 220,936.00 = 68.00 = (i — ву Ла So 37. 
with 48 degrees of freedom. 


In an experiment designed in this manner, it is possible to test several hypotheses. 
The experimental error is used to test the variation among treatment means. In this 
particular case the treatment mean square is less than the experimental error mean 
square which indicates somewhat more uniformity among the treatment means than 
might be expected in a population with error variances of the magnitude found in this 
experiment. 

The F ratio of the experimental-error mean Square and the mean square associated 
with the differences among plants within the cells of the 4 X 4 latin square is F 
= 556.20/263.67 = 2.11, which is slightly lower than the tabulated F value, 2.40, 
at the 5 per cent level of probability for 6 and 32 degrees of freedom. 

One could test the hypothesis of no differences 
days by the F test, 


2 2 
+ 65° 64 — IF +... + 308 + ap — 90 


among the plant means within 


The corresponding F value at the 5 per cent point for 8 and 24 degrees of freedom is 
equal to 2.36. The variation among plant means for each day could be tested in a 


like manner to determine if any group of three plants may be considered as unusually 
variable. 


The variance attributable to the differences 
1.42, is extremely small in comparison with the 
sion is that one reading per plant would be s 
groups of plants are required. If this is impossi 
replicates of the treatments are required to ob 
relatively small. 

The treatment mean square is smaller, but not, signifi 
others (table VI-4). Neither of the mean sq ium Mad аг ШО 
exhibit any unusual variability. If the treatment 


between readings on the same plant, 
remaining mean Squares. The conclu- 
ufficient and that more homogeneous 
ble, then more plants per cell and more 
tain standard errors of a mean that are 


= 4/556.20/2 х3 X 4 — 4.81, and 

the standard error of a difference of two treatment means is s; — /2(556.20)/2 X 5 X 4 
] pe relative to а randomi lete 

block design or to a completely randomized desi ы ош 


556.20/6 
6. 


w == ——1695 ^ 21 per cent, 


which appears to be rather high for experimental work. There might be differences 
among the treatment, row, or column means, but the experimental material or methods 
were too variable to allow differences of this size to be detected. Following the results 
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from the initial analysis of variance, Taylor [288] studied the experimental material 
and the procedure. He found that plants with high potential readings tended to remain 
high and vice versa. With this information, the plants were divided into homogeneous 
groups with regard to magnitude of initial bioelectric potential readings, and the 
differences among the groups were confounded with day to day differences by applying 
the treatments to a different group each day. By refining his techniques further, it 
was possible to reduce the magnitude of the experimental error mean square and to 
detect differences among the treatments. 

Another arrangement of the above experiment would be to use three different plants 
in each of sixteen cells of the 4 X 4 latin square, resulting in a total of forty-eight plants 
rather than the twelve used. This procedure was considered impractical owing to the 
amount of time required for setting up the apparatus to obtain readings on differences 
in bioelectric potentials between the stem of a bean plant and the nutrient solution in 
which its roots were submerged, but if it had been used, the breakdown of the total 
degrees of freedom would be t 


———— U eee 


Source of variation Degrees of freedom 

Row (time of day) 3 

Column (day of week) 3 

Treatment (light intensities) 3 

Error (experimental) 6 

Among plants within cells 2 X16 = 32 

Between readings on same plants 1 X 48 = 48 
——_____“etween readings on same plants — — — 13X48—48 — 

Total 95 
E ам а ка, ен д o 0 000 


VI-1.9 MISSING DATA 


VI-1.9.1 Missing experimental units. Allan and Wishart [1] and 
Yates [316] present the following formula for estimating a missing yield for 
the ith row, jth column, and hth treatment in a k X & latin square: 


EO XX AX c 
Aan =e ч 


Where the totals are as defined previously. Yates [316] has also given an itera- 
tive method for estimating the yields for several missing values in a k X k 
atin square. For each missing datum computed, 1 degree of freedom is sub- 
tracted from the error degrees of freedom. 

Bartlett [14] suggests the procedure of inserting a one for the missing 
value and zeros otherwise and performing a covariance analysis with the zeros 
ae the one as the independent variate (see Chapter ХУТ). If more than one 
experimenta] unit is missing, the same procedure is followed except that a 
oe covariance is performed. Nair [216] used Bartlett’s [14] method for 

* Yzing the results from a k X k latin square design with several missing 
See А Paper by DeLury [85] in 1946 summarizes most of the results for 

mg missing experimental units in latin squares or sets of latin squares. 
© row, column, and treatment mean squares have expectations which are 
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slightly too large. The correct mean squares may be obtained with little addi- 
tional work [316]. 


VI-1.9.2 Disproportionate numbers in the experimental unit. If 
disproportionate numbers of observations per experimental unit are available, 
one of the approximate methods of analysis described in V-1.6.2 may suffice. 
If a more precise analysis is desired, an extension of the methods described 
in Chapter 11 of Snedecor's Statistical Methods [273] may be used. Tt will be 


necessary to estimate treatment regression coefficients as well as those for 
rows and columns. 


VI-1.9.3 Other situations. In some cases the latin square may be 
designed with k — 1 treatments in the Ё rows and columns or one of the treat- 
ments may have failed in the experiment. Yates [321] presents the analysis 
for this situation. Also, if one row or column has been lost or is omitted in the 
latin square, Yates [321] gives the method of analysis and illustrates it with 
а numerical example. Yates and Hale [333] give a method of analysis for two 
or more missing rows, columns, or treatments in a latin square. Their results 
are illustrated with an example. It should be pointed out that one or more 
rows, columns, or treatments may be missing either by design or by accident. 
DeLury [85] generalizes their results to several latin squares, and Smith [269] 
presents the analysis for missing observations in an incomplete latin square. 

If two experimental unit yields are obtained as a single total and not 
individually, the method described by Bose and Mahalanobis [33] and by 
Nair [216] may be used to estimate the missing yields. 


VI-2 Least Squares Estimates and 
Expectation of Mean Squares 


In the following discussion, it is assumed that treatments, columns, rows, 
and squares represent random samples from their respective populations. If 


any or all of these items are considered to be from a finite population, adjust- 
ae in the expectations may be made in the manner described in Chapters 
and V. 


VI-21 ONE UNIT PER EXPERIMENTAL UNIT 


If a single observation i 


| е s made on each plot of a latin square design, the 
yield of the ijhth observati 


on may be expressed as 


Xin = nd pit ть 4 €ijny (VI-21) 
| ТЕЕ Оо. *, k appears once in a row and once in 
s # represents the population mean, p; an effect common to the ith 
ne з an effect common to the Jth column, z, an effect common to the hth 

reatment, and e;;, an effect common to the ijhth observation; it is assumed 


where i, j = 1, 2, .. 
a column 
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that the yield of any observation is expressible as the sum of several inde- 
pendent linear effects. 


VELLI Least squares estimates. The least squares estimates of the 
3k + 1 parameters, u, ру, +++, рь, An ++ *, Ay т, ** 5,74, are obtained by partial 
differentiation of the residual sum of squares with respect to the 3k + 1 
parameters, by setting the resulting equations equal to zero, and by solving 
for the set of estimates. The residual sum of squares after fitting the 3k + 1 
constants is 


2 Xin — A — rn — ej — by, (VI-22) 

and the normal И after differentiation аге 
X... = kori + kD + ЁЗ А + Ef, (VI-23) 
Xi. = kri + Dey + УМ, + bh, (VI-24) 
T X4. = Yrs + ke; + УМ, + kp, (VI-25) 
Х.. = Dori + Ус; + Ы, + kf. (VI-26) 


The r;, с» tx, and f are the estimates which make the residual sum of 
Squares a minimum, Now in order to obtain a unique solution the following 
restrictions are necessary: 

ЫРУ: 0. EN 
T j ^ 


With the above conditions, then, 


а = 2 = X.../k, (VI-28) 
та Жүр д = 5. 8, (VI-29) 
i ej = X4Jk hm 2. — 8, (VI-30) 
hm XE ado d (VI-31) 


The variances of the least squares estimates may be obtained as before 
(Chapters IV and V). 


V1-2.1.2 Expectation of mean squares (Model II). The sum of 
Squares due to Ё is the correction term and has the expectation: 


Dut ps + Xy H tat ein) | 

2 = 

Ерх..] = PES | - li Р 

= kp? 4 ko è + kox? + ko? + o2. (VI-32) 
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The total sum of squares (uncorrected for the mean) has the expectation: 
EL 22x] = 2 El + pi + NE Ta H er 
= R + op о +07 + 00). (VI-33) 
The residual sum of squares after fitting ri, су, tn and р has the expectation: 
AA ах rA Ха. — УХ. 
= В оо +o +02) — (Kp + ko + kok + ko? + koe’) 
— (E? + hod + kox + ko + ko) — (kp? + ho? + kox + ka? + koe) 
+ 2(® н? + ko 2 + kox + ko? + сё) = (k — 1)(k — 2)сё, (VI-34) 


which is the residual error variance times the degrees of freedom. 

The sum of squares due to the r; only is the sum of squares due to A, Tis 
cj, and t» minus the sum of squares due to р’, cj, and t,’ , assuming г; equal to 
zero; for the orthogonal case, the expectation is 


EDXnX.l- p| zx š 21 = (k — 1)(е2 + В), (VI-35) 


with k — 1 degrees of freedom. 


In a like manner the sum of squares due to the c; and the t, have the re- 
spective expectations: 


(k — 1) (e + ko?) (VI-36) 


and 
(k — 1)(o2 + ko?). (VI-37) 


The total sum of squares corrected for the mean has the expectation: 


се 
pi k 


= (kè — od + (k — D) ke? + (k — Y) Rog + (k — 1) Ком, (VI-38) 


which is the sum of the expectations for the treatment, row, column, and 
residual sum of squares. ) 


У1-2.2 з SQUARES ОЕ k X k LATIN SQUARES 


The linear model for s groups or squares of k X k latin squares is 


Хум = p + òi + ту d бту + pin + Nio + ehe (VI-39) 


where p represents the population parameter for the mean, ô; = an effect 
common to ith square, т; = effect common to jth treatment, ёт; = an effect 
common to the jth treatment in the ith square, p:n = effect common to hth 
row in the ith square, А = effect common to the gth column in the ith 
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square, iji; = effect common to ijhgth observation, i = 1, 2, *-:, s;J = 1, 
2, +-+, k and appears once in а row and once in a column in each square; 
h = 1, 2, +++, k; and g = 1, 2, +++, k. 


VI-2.2.1 Least squares estimates. The least squares estimates are 
obtained from the following sets of normal equations: 


X... = BD: + УУ, + kZ Z dla + Ер L + с + skh. (VI-40) 


Xi... = Kd; + КУУ, + ЕЗ йу + kE rin + ЮУ + E. (VI-4)) 
X... = KY d; + sh; + ЕЗ йу; + CDi + 22226 + 8. (VI-42) 
Xij.. = kd; + kl; + kdl; + Dorin + 2265 + kh. (VI-43) 
Хы. = kd; + Yol + Уйу + krin + 226i + kh. (VI-44) 
Хе = kd; + Dy + Уй + Prin + Res + kh. (VI-45) 


After imposing the restrictions, 
Ea: = Уу, = Уу = Dew = Zdi; = Deals = 0, (T9) 
h g i ы i 


the estimates are found to be 


R = = X... /sk, (VI-47) 

d; = (X,.../k) — 8 = i... — Ž, (VI-48) 

lj = (X.;../sk) — 2 = Ē.j.. — 8, (VI-49) 

Pin = (Xi.n./k) — Zi... (VI-50) 

Cig = (Xi..,/k) — 8i... (VI-51) 
and 

dli; = (X../k) — (Х‹:.../Ё) — (X.4../sb) + 2. (Ү1-52) 


VI-2.2.2 Expectation of mean squares (Model II). The total sum 
of squares corrected for the mean, or the sum of squares after fitting д, has 
the expectation: 

E| Exin = z] = У) Elu + ôi + rs + ӧтә + pin + № + ео]? 

1 ijho 
= Elska + (ду +. vee + 8) + skla t +++ Hr) + körat +++ H n) 
+ Ron + +++ + pa) Фа t eee X) + 2727206] 
= SEP + o + с + os? F op H ot toe) — GEI! + Eoi + skot + kon? 
+ ko? + kox? + o2) = (sk? — Woe + (sk — 1)ko è + (sk — Doy 
+ (sk — kos? + sk(k — 1), + (s — lo? (VI-53) 
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The sum of squares for treatments has the expectation: 
E[$26X.,.] = p| УХ — zs] = (k — 1)(с2 + kos? + ska), (VI-54) 
with (k — 1) degrees of freedom. 
The squares sum of squares has the expectation: 


ах] ра = Ee 
= (s — D) (e? + kay! + ko? + kos? + hos), (VI-55) 
with (s — 1) degrees of freedom. 


The treatment X square sum of squares has the expectation: 
хаах? Xj? Es 
sz Bopp = У B 
= (s — D(k — 1) (c? + hos). (VI-56) 
The row within squares, column within squares, and residual within- 
Squares sums of squares have the respective expectations: 


s(k —1)(o2 + ke 2), s(k — 1)(e2 + kox), and s(k — 1)(k — 2). 


УІ-2.3 k X k LATIN SQUARE WITH p ITEMS PER CELL AND d DETER- 
MINATIONS ON EACH ITEM 


It is assumed that the treatments, rows, columns, items, and determina- 
tions are random samples from their respective populations and that the yield 
of the ijhgfth observation may be expressed as the sum of the several inde- 
pendent effects; that is, 

Хуму = b+ pi +N H tat eg + Tijng + Sijnosy (VI-57) 
where и = population mean, р; = effect common to ith row, A; = effect com- 
mon to jth column, 7, = effect common to the hth treatment, єуһ = effect 
common to ijhth cell, Tijn = effect common to the gth item in the ijhth cell, 
буу = effect common to the ijhgfth determination, i, j = 1, 2, ++», k; 
h = 1, 2, «++, k appears once in each row and once in each column; g = 1, 2, 
* p; and f = 1,2, ---, а. For this example a new sample of items is used in 
each cell of the k X k latin square and determination f for one item has nothing 
in common with the fth determination for another item. 

The léast squares estimates may be obtained as before and are left as an 


exercise for the student. Also, the following expectations may be verified by 
the student: 


Source of variation 


Degrees of freedom Average value of mean square 
BUT Ica os + doz? + dpe? + Карор? 
ү шын ] k—1 os? + doz? + dpe? + Карол 
КОО = oi + doy? + dps? + Карот? 

d Же 2 2 2 

Items within cells í ET 1) ) E i ER тше 
Determinations on same item pk*(d — 1) cit r 
Total 
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In the event that a sample of items is used for each column (see example 
VI-3), the linear model is 


Xin = BH pi + М H Ta F eijn + Tjo + sino F булау» (VI-58) 


where the effects and subscripts are the same as before except for ту and 

«ум. ту, is the effect common to the gth item in the jth column, озум is the 

effect common to the ijhgth observation, and g = 1, 2, + ++, p in each column. 
For this case the expectations for the various mean squares are 


Source of variation Degrees of freedom Average value of mean square 
Row k-1 os? + doa? + рое + аркар 
Column k—1 cj? + dog? + dko,? + dpe? + dpkoy* 
Treatment Ic сз? + doa? + рсе + dpko,? 
Residual (k — 1)(k — 2) os? + doa? + dpo? 
Items within columns kíp- 1 os? + dea? + dkoz? 
Remainder kíp — (к — 1) ©з? + dea? 
Determinations on same item pk*(d — 1 os 
Total dpk? — 1 =a 


See EE SSS 


The correction term has the expectation: 
„| Х...2 
| et додав + dok + Eu Ee + PELE 
+ УУ + dYXYEXeec ELELE bun] 
= dpi? + dpk(o,? + ox? + 92) + dpod + с? + doa? + ef = CT. (VI-59) 
— 1) degrees of freedom 


П The items-within-columns sum of squares with k(p 
tas the expectation: 


Е| EEX: _ хы 
dk > арк 


= Фра + арьс 2 + ароз + dpko + dpke? + ёре? + dpka + Крой 
7 (руз + арро + dpk(o,? + o2 + 22) + dios? + dko + koi] 
= Мр — Doi + doa? + Фо 2). 


á The remaining expectations are obtained 
te for the student. For further discussion 
308, sec, 9.4], Mann [207], and Kempthorne [175, Ch. 10]. 


(VI-60) 


similarly and are left as an exer- 
the reader is referred to Wilks 


VIS Development of Formulae for 
Missing Experimental Units 


dor estimating the value of a single experimental unit, there is no loss in 
: he first row, in the first 


genera; К А > 373 Е 
Б°пега у if we assume that the item is missing in t 
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column, and for treatment one. This is true since any arrangement of a latin 


square can be rearranged in this way. The residual sum of squares, including 
the missing value, Xin, is 


k 
Xu УХ? - 5 + Žu)? + 2X: + (Xa + Xy 


4K 2+ Qa d ay + Xx. 3 ыр. (У1-61) 


Тһе various totals are composed of the yield values in the latin square, 


assuming Xm equals zero. The partial derivative of the above sum of squares 
with respect to Xin is 


2Xu- s. +. TXa. + Хм TX.a tX) gr сыш (VI-62) 


We set the above equation equal to zero and solve for Ж. 
equation (VI-20). 


Suppose now that two values are missing and that the values to be esti- 


mated are Xi, and Xm; i.e., two of the yields for treatment one need to be 
estimated. The error sum of squares is 


eet at yy, a Hass КОШ о) 
XE + Xr E (Хал. + Xu)! + (Xa. + Аш)? 

TAG e EXP + (Xa Xu + Xen)? EX E vee t kal 
ла ZG. + Xin Жш), 


ш. The result is 


(VI-63) 


where the totals are composed of the available yield values. 


The partial derivatives of equation (VI-63) with respect to Xi, and Xo 
are 


2 
адаа ot Bt al кх) (VI-64) 


and 


2 
to E Katt Kt shad +H a tact (У1-65) 


5 M: set the above two formulae equal to zero and solve for Хш and ssi, we 
obtain ; 


= RG. Xa. EX.) — 2X... R(X... = 
Хы KR-X2W-D + EC MC. тшш 


and 


Xm- Ks... +X a. E X.) — 2X... R(X). 55 MES 
2 k(k — 2)°/(k—1) д. КОХ atte 2X... (VI-67) 
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The above formulae may be used to estimate two missing values in the same 
row or in the same column merely by changing the subscript designations. 
Also, the same procedure may be used to obtain formulae for other combina- ` 
tions or more missing values. For example, suppose that the two missing values 
are for different rows, columns, and treatments and that the two values to be 
estimated are Ñu and Хз. If the above procedure of minimizing the residual 
sum of squares is followed, it will be found that the values of Xin and Xo are 
given by the formulae, 
, те R(X... + Xa. + X.a) — 2X... 
k(k — 3)(R? — ЗЕ + 4)/(k — 1)(k — 2) 


= um + Ха. XA) = ue (VI-68) 
j k(k — 3) — 3k + 4) À 
an 


row k(Xs.. + Xa. БХ.) — 2X... 
2 = E(k — 3) — 3k + 4)/(k — D(& — 2) 


1 Ета ташу = 2х... (VI-69) 
kk—3)EB-—3k-c4 ? 

No loss in generality is suffered by the procedure outlined above, since 

formulae (VI-68) and (VI-69) may be used to estimate missing values for 

different items of the three categories, rows, columns, and treatments. 


CHAPTER VII 


The Choice of Treatments and the Factorial 
Experiment—p" Series 


ҮП-1 Selection of Treatments and 
Treatment Combinations 


The foregoing chapters deal with the construction, layout, and statistical 
analysis of some of the simpler and more widely used experimental or observa- 
tional designs and with plot or pen techniques. The merits and faults of each 
of the designs are discussed. Before proceeding to more complicated experi- 
mental designs, it is deemed desirable to discuss the concepts related to the 
design and to the analysis of “factorial” experiments. The material developed 
in the present chapter is used to a considerable extent in the following chapters. 

In many experiments, success or failure may depend more upou the 
selection of treatments for comparisons to be made than upon the design. 
The selection of both the design and of the treatments is important, and 
neither should be slighted in planning the experiment. For example, the 
design of the experiment might be a latin square and the treatments new 
methods or new varieties ign may be quite appropriate, but the selec- 
only new methods or varieties affording 


» the proper choice of treatments 
1се of design, such as a systematic, might 


Other examples of the selection of treatments could be cited but the only 
one that merits considerable d; i 


erable discussion is the group of treatments involving 


two or more levels or kinds of two or more substances or factors in combina- 


tions; a few such examples are 


(i) light, oxygen, and temperature, 
(ii) temperature and length of Storage, 
(ii) spacings and rates of planting, 
(iv) levels of two or more fertilizers, 
(у) levels of ingredients and methods of mixing 
(vi) levels of ingredients and baking temperatures 

(vii) length and types of mixings, í 
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(viii) levels of proteins and carbohydrates in feeding trials, 

(ix) levels of insecticides or fungicides and varieties or species, 
(x) methods of teaching and schools, 

(xi) teaching methods and age of students, 

(xii) stores and price premiums, etc. 

А group of treatments which contains two or more levels of two or more 
factors or substances in all combinations is known as a factorial arrangement. 
As indicated by the various types of faclorials listed above, the factorial 
arrangement of treatments covers a wide variety of experiments. 

For the successful conclusion of an experiment, several items, of which 
the choice of the treatments and of the design represent only one phase in 
the planning of an experiment, are of considerable importance. Any experi- 
ment may be planned satisfactorily, but many things can go wrong in con- 
ducting the experiment: the experimental site might be located in the path of 
floods or made variable in other ways; the measurements could be recorded 
unreliably, thereby vitiating the results; a poor choice of size and shape of 
experimental unit and replicate and number of replicates may have bearing 
on the successful conclusion of an experiment, etc. (see Chapter I). However, 
it will be assumed that the other elements of scientific experimentation have 
been considered and that a factorial experiment has been decided upon. 
Some general concepts and definitions related to factorial arrangements and 
some alternative procedures will be cited prior to the discussion of specific 
factorial experiments and illustrative examples. 


VII-2 The Factorial Experiment 


The factorial experiment was called the “complex experiment” prior to 
1926 when Fisher [125] designated it as the factorial experiment. Yates [319, 
324] followed this notation, and at present an experiment containing all 
combinations of several levels of several factors is known almost exclusively as 
a factorial experiment. Yates [319] states that the complex or factorial experi- 
ment has been used on wheat experiments at Broadbalk since 1843-44 and 
on barley experiments at Hoosfield since 1852. Although Fisher [125] is 
primarily responsible for the development and analysis of factorial experi- 
‚ ments, Yates [319, 324] deserves no small amount of the credit for extending 
the development and for obtaining the analysis for factorial experiments. 
The classic work on factorial designs in complete block and in incomplete 
block designs is the pamphlet written by Yates [324] in 1937 and entitled 
“The design and analysis of factorial experiments.” Except for some notational 
deviations, the definitions, concepts, and analysis used in the present text 
follow those of Yates. à i 
VII-2.1 DEFINITIONS OF MAIN EFFECTS AND INTERACTIONS 
sign and analysis of factorial experiments, 


Before proceeding to the de t 
^ : d some facts are noted. First, the factorial 


Some definitions are set forth an 
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experiment, in itself, is not considered as an experimental design. Any of the 
experimental designs discussed thus far, or others, may be used for the fac- 
torial experiment. The choice of treatments, the "treatment design," deter- 
mines whether or not the experiment is a factorial. However, the construction 
and analysis of factorial experiments are extremely useful in the design and 
analysis of certain incomplete block designs. Second, the factors are designated 
by lower case letters a, b, с, etc. The particular level of a factor is denoted 
by the lower case letter with a subscript аз, bj, сь, etc. The particular combi- 
nation of the various factors is denoted by either symbol ajb;c;- ++ or i hes, 
The latter designation is the one most commonly used in the remainder of the 
text. Third, comparisons among the combinations (treatments) in the factorial 
experiment are denoted by capital letters A, B, A x B, C, etc. These symbols 
denote the effects. The effects in a factorial experiment are composed of 
main effects and interactions. A main effect, say A, represents a set of p — 1 
orthogonal single-degree-of-freedom contrasts among the p totals, say (A); 
of a given factor, say a. The levels of an effect are designated by the symbols 
(A)o, (А), ***, (A)p-1. The parentheses around the letter denote that the 
level of the effect, say (A);, is a total of all experimental units contributing 
to the ith level of the effect: the symbol A; denotes that the level of the effect 
is on an experimental unit basis. Likewise, the symbol (A) denotes that the 
effect is on a total basis, whereas the symbol A denotes that the effect is on an 
experimental unit basis. Yates [324] uses square brackets instead of paren- 
theses to denote that the effect is on a total basis; thus, (A) = [A]. Fourth, 
all the units are used to evaluate main effects and interactions, Fifth, the 
lowest or zero level of a factor need not be zero or none but is the lowest level 
of the factor considered pertinent to the experiment; e.g., 50 pounds equals 
zero or the lowest level and 150 pounds equals one or the highest level of a 
factor. Last, the interaction of factors needs to be defined both by words and 
with symbols. 


The interaction of two factors is the failure of the levels of one factor, 


‚ зау bo, is a measure of 
the interaction of the two 
factors a and b at zero and one levels is (AB) = (qb, — abı) — (aibo — aobo). 
nteraction of two factors each at two levels is 


presented in figure VII-1. Zero interaction would be obtained if the yields for 
the factor bo were as pictured by the broken line.! 


The interaction of two factors each at three levels is pictured in figure 


Other types of interaction are i i ity i i 
sec. 50-53; 256]), but these are not diee we, (°-8 quantity X quality interactions [126, 


scussed in the present, text [also, see 311]. 
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Yield —> 
Yield —э~ 


а а 92 


Л Figure VII-1. Graphical represen- Figure VII-2. Graphical representation of a 
tation of a two-factor interaction for two-factor interaction for a 3 X 3 factorial. 
а 2 X 2 factorial. 


VII-2. Zero interaction of the two factors is obtained when the graphical 
array of yields is as pictured by the broken lines. The graph for the broken 
lines is obtained by plotting the expected values for zero interaction as per 
the following table: 


D l mur s s 
b: 


m b 
ао x х+у x-dz 
ai x+u x+y+u x+z+u 


8: xdv KEYEN x+z+V 


The sum of Squares of the deviations of observed values from the above 
values is the sum of squares due to interaction. The resulting mean square 
15 then compared with the error mean square to determine whether or not 
there is an interaction of the factors. 
. The interaction of three factors, say a, b, and c, is illustrated graphically 
In figure VII-3. If the graph for c, followed the broken lines, there would 
© a zero three-factor interaction. As illustrated, a nonsignificant (or signifi- 
cant) three-factor interaction does not require that the two-factor interactions 
© Nonsignificant (or significant) but that the interaction of two factors be 
the same for all levels of the third factor (see problem VII-3). 
The above definition of an interaction allows interchange of any of the 
letters їп the graph. The interaction of a with b, or A X B, is the same as 
the interaction of b with a, or B X A. Likewise, the interaction of three 
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factors represents the failure for a two-factor interaction, say A X B, to 
remain relatively constant at all levels of the third factor, say c; also, the 
interaction of three factors is the failure for A X C to remain relatively the 
same for all levels of b. 


Yield —> 
Yield —> 


ч 


o а а; ао а 


Fi 4 M х i + . H D D 
More IL-3. Graphical representation of a three-factor interaction for a 3 X 3 X 2 


я Ше D A X B is used to denote the interaction of two factors and 
54 X to denote the interaction of. three factors. These interactions may 

be partitioned into component parts, According to a convenient symbolism, | 

which is introduced later, AB represents one part of the A x B interaction 

when the factors are present at more than two levels Likewise 

i interactions may be partitioned into component, parts 

tak zi) two-factor interaction Sum of squares шау be obtained from the 


сї (ixx Oi 3 (VII) 
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where j = 1, 2, «++, p; h = 1, 2, «+s, q; the factor a is at p levels, the factor 
b is at q levels, and there are r replicates on each combination. The dot in- 
dicates summation over the subscript it replaces. 

The three-factor interaction is obtained from the formula, 
Xl Xia NEG d. 


1 TT Т ЕТ Nu 
XXIe hq ^ kp + pq k q p ko 


1 wx "eel 1 1 
a bon РӘ Ухх. ауз 


mt a 1 ix =. Gok Ў 
rp ae - УУХ. Ер h (VIL 2) 


where g = 1, 2, +++, k levels of factor c and the other quantities are as de- 
fined above. Four- and higher-factor interactions may be obtained from an 
extended form of equations (VII-1) and (VII-2) or by alternative procedures. 
F urther discussions on the formation, calculation, and interpretation of various 
Interactions are given later for specific factorials and examples. - 


MENG ADVANTAGES AND DISADVANTAGES OF FACTORIAL ARRANGE- 

As stated at the beginning of this section, all combinations are used to 
evaluate main effects and interactions in a factorial experiment. This is not 
true for other arrangements of treatments. Therefore, there is some loss in 
information in nonfactorial arrangements owing to the fact that only a frac- 
tion of the total number of experimental units is used to evaluate an effect. 
The percentage of the total number of experimental units used to evaluate 
an effect depends upon the composition of treatments in a nonfactorial experi- 
ment. In addition, the main effects in a factorial arrangement are evaluated 
9Yer a wider range of conditions, and information is obtained on the inter- 
action of factors, In fact, if the objective of the experiment is to estimate main 
effects ang interactions, a factorial arrangement of treatments (either a 
Complete factoria] or an appropriate subset) is optimum for this purpose [175, 
E: 425]. A factorial experiment allows description of the response pattern over 
à Prescribed surface, If the objective of the experiment is to describe a par- 
ticular point on the response surface, a nonfactorial set of treatments may be 
more aPpropriate than a factorial set (see section VII-3). 

„ 16 use of all experimental units in evaluating an effect increases the 
еШсїепсу of the particular experiment. The concept of “hidden replication" 
aS opposed to “absolute replication” [126, sec. 40] or the repetition of the set 
d, reatments should be noted. Fisher [126, sec. 40] and Yates [324] state 
fidei 18 often desirable to include additional factors in an experiment in 
Es s © observe the effects of interest over a wider range of conditions. Аз 
aif ре of this, ап experimenter might include dates of planting or 

Ping as an additional factor in a variety yield trial to measure, to some 


12 
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extent, seasonal variations. Also, several technicians instead of a single one 
may be used, or animals from several strains may be included in the experi- 
ment in order to cover a wider range of conditions for the treatments being 
compared. Too often this fact is overlooked in experimental work, and the 
different “absolute” replicates are too much alike to make inferences of the 
desired nature. However, for experiments on the standardization of tech- 
niques, it may be desirable to limit the number of subsidiary factors in the 
preliminary stages of the study. 

Factorial experiments result in unbiased conclusions even if trends or 
gradients are present in the experiment, although their effectiveness is some- 
what reduced [102, Ch. 13]. Worker fatigue, temperature, light, and humidity 
changes, gradients in material, certain types of interactions, etc., account for 
trends in experimental material. 


Briefly then, the advantages of a factorial experiment may be summarized 
as follows: 


(i) all experimental units are utilized in evaluating effects, resulting in the most 
efficient use of resources; 


(ii) the effects are evaluated over a wider range of conditions with the minimum 
outlay of resources; 


(iii) an estimate of the interaction of the factors is obtainable; 


(iv) unbiased estimates of effects are obtained whether or not time trends are 
present; 


(v) a factorial set of treatments is optimum for estimating main effects and 
interactions. 


The so-called disadvantages of factorial arrangements of treatment com- 
binations are often more superficial than real [60; 126; 102, Ch. 13; 324]. 
The factorial arrangement may turn out to be an inefficient procedure to obtain 
the answer for a specific question. However, information on a number of 
effects and interactions is usually preferable to information upon a specific 
combination. The objectives of the experiment will determine whether or not 
a factorial arrangement of the treatments is desirable. 

A One real disadvantage of factorial arrangements is the large and prohibi- 
tive number of combinations necessary to study several factors at several 
е For example, a factorial arrangement of seven factors each at three 
pec d combinations, and a factorial arrangement of ten factors 

requires 1024 combinations. The scheme known as “frac- 


tion ication” 
ла replication (see Chapter IX) greatly reduces the number of combi- 
(es required to estimate certain effects. 
5800) i 
some RPEN with the large number of treatment combinations required in 
ae ав is the decrease in the efficiency of an experiment 
ат e edm d the replicate. This difficulty may be overcome 
} о . H “+ V 
roues ihe RE ре unimportant effects with “incomplete blocks 


ck contains only a portion of the total number 


8 VII-4.1) The 2 X 2 Factorial 173 


of treatment combinations and, therefore, does not include as much of the ex- 
perimental material as the complete block. With confounding, the complexity 
of computations increases; the extent to which this is a disadvantage depends 
upon the complexity of the analysis and the available computing facilities. 


VII-2.3 CHOICE OF LEVELS 

The choice of levels of factors to be used in an experiment depends upon 
the nature of the experimental yields and upon the objectives of the experi- 
ment [82, 268, 271]. The form of the response curve and the portion of the 
curve studied determine the number and location of the levels to be investi- 
gated. If the experimenter knows the range in levels of interest and if he desires 
to investigate the form of the response curve, he should select as many levels 
as are practical. The selected levels may or may not be equally spaced. To 
facilitate the computations of curvilinear responses of levels [273] of an 
effect, it is suggested that equally spaced levels be chosen; i.e., one unit, two 
units, three units, etc., or k? units, &! units, k? units, etc. The latter levels, · 
k, are equidistant on a logarithmic scale. If the range in which the factor 
is effective is unknown, but the lower (or the upper) limit is known, then it is 
Suggested that the first level be Ck®°, the second level be СЁ!, the third level 
be Се, etc. where C is the amount used for the first level; with such a scheme 
à wide range is covered with a limited number of levels. 


VII-3 Procedures to Investigate Specific Objectives 


A number of procedures have been developed to investigate specific areas 
or points on the response surface. If the objective of the experiment is to 
estimate the concentration of a drug yielding a specified percentage of suc- 
Cesses or to estimate the combination of two factors giving the maximum (or 
minimum) yield, specific procedures resulting in a nonfactorial selection of 
treatments are available. The probit technique method is used extensively to 
estimate the concentration of a drug necessary to obtain a given percentage 
of success [23; 24; 126, sec. 69]. Alternative procedures for this problem have 
been proposed by Dixon and Mood [91] and by Robbins and Munro [257]. 
Procedures for estimating a maximum have been described by a number of 
соога [102, Ch. 13; 161; 182]. The procedure developed by Kiefer and Wol- 
OwItz [182] is operationally simple and tends to concentrate the combinations 
of factors used near the combination giving the maximum yield. А more gen- 
eral approach for studying the entire response surface has been presented 

Y Box et al. [35, 36]. 


ҮП-4 The Factorial Experiment—2^ Series 

VII-4.1 THE 2 x 2 FACTORIAL 

AC peel some experimenters may prefer to think of the effects and inter- 

it has [m terms of the I, J, W, X, Y, Z, etc. effects outlined by Yates [324], 
n found that the modulo notation [29, 30, 173, 175, 177] is extremely 
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useful for the more complex factorial experiments and for confounding certain 
effects with block differences (see following chapters). The method has been 
found to be useful in the design and analysis of incomplete block designs [177]. 
The use of modulo notation in the present text is confined to prime numbers 
(2, 3, 5, 7, 11, 13, 17, etc.) or powers of prime numbers. The system used does 
not work for nonprime numbers in that the effects so estimated are partially 
confounded. For modulo p the following relationship among numbers exists: 


0=р=2р=... = @р; 
l=p+1=2p+1=---=qp+1; 
2=p+2=2%p+2=-++-=qn+2; 


. 


p-l-2p-l1-3p—-1-2-..—gp-tp-—1. 


The subscripts of the factors a, b, c, etc. are i, j, h, etc., where i = 0, 
l,e p—1;j= 0,1, ---p—1;h- 0,1, <+., p — 1; etc. For a р" = 2? 
factorial the levels of the main effects and interactions are composed of the 
following treatments: 

(А); = aobo + аб, = 00 + 01, 
(А) з = а + abı = 10 + 11, 
(В); = ab, + abo =00+ 10, 
(B);-1 = ab + abı = 01 + 11, vara) 
(AB) о = aobo + аһ, = 00 + 11, 
(AB) ix = adi + аЬ = 01+ 10, 
and the effects and interactions are 


M= Ui (s (10+11)—(0+01); А = п1о--11— 00—01); (VII-4) 


(=~ B= 01-11) - (00-10); B= 11.111. 99. 30), (VII-5) 
AB) = (AB) )— = 

(4B) = (АВ), (АВ), = (0--1)) — (01 +10); AB = (01-01-10), (VII-6) 
where r — the number of rep 
placed by treatment totals, 
as the difference in the two 
the second factor; thus: 


(AB) 


licates and thi 
The two-factor i 
levels o; 


€ treatment designations are re- 
nteraction effect, AB is estimated 
Ї one factor compared at the two levels of 


[bi TH bo] in the presence of q— = 
b i do 
ЖЗ AM 1 — [bı — by] in the presence of 
11 + 00 — 10 — 91. 


ll 


(VII-7) 


§ VII-4.1] The 2 X 2 Factorial 115 
Also, the interaction АВ is equal to the interaction ВА: 
(BA) = [а, — a] in the presence of b; — [а — ав] in the presence of bo 
= 11 + 00 — 01 — 10 = (AB) = (AB), — (AB). (VII-8) 
The above comparisons correspond to the table of plus and minus signs 


used prevalently in statistical references [60, 273, 324]; 


re Cee eee ERR 
Combination of treatments 


Effect abo = 1 aibo = a ab; = b ab, = ab 
(A) - + - + 
(В) - - * T 
" B) + - - + 
©) __ +. „= и ИЛОЧИ МНА 
Total + + + + 


Thus, (A) = —ab + аЬ — abi + abı = 10 — 00 + 11 — 01, etc. 

Yates [324] and Fisher [126, sec. 43] present a set of algebraic quantities 
for obtaining the signs in the above table. They define the factor levels as 
follows: ay = 1, а, = a, bo = 1, b; = b, and aobo = 1, aibo = а, аф = b, and 
аф, = ab, Then, the following algebraic quantities give the signs for the 
effects in the above table: 


(A) = (а= 1)(b --1) =a-1+ab—b 
(B) = (a4-1(b—1 2b—1-4ab—a f. (VII-9) 
(AB) = (a — 1(b—1) =ab+ 1 — a — b 
Another (equivalent) procedure for obtaining the coefficients for the interac- 
tion AB is to multiply the coefficients for A by the corresponding ones for B. 


E The sums of squares for the main effects and interaction from r replicates 
re 


[4 = (Ax. Xn b Xa — Xe — Ха) (vio) 


4r гр +1 + (1) + (097 
((B) — (By? (Ха T. Xa — Xo = Xa? Ж 
4r SU DI umen 2 ilb 
and 
КАВ) — (АВ? _ QCu Xo — Xa — Xo (уә 


4r ТЕЕ тве аа? 

ү X00, Xon X40, and Ж. are the totals for treatments 00, 01, 10, and 

> respectively, The replicate and error sum of squares are obtained in the 
Usual manner, 

Ae Andividual errors may be computed for each comparison; i.e., the inter- 

E of the three effects A, B, and AB with replicates yield three error 

Seach with r І degrees of freedom [273, sec. 15.7]. Usually the pooled 
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error is used in an experiment, since the individual error variances are consid- 
ered to be estimates of the same residual error variance s. 

From the above table of plus and minus signs, it should be noted that the 
treatment totals may be obtained from the effect totals. This fact is impor- 
tant in the analysis of experiments in which some of the effects are confounded. 
The various totals may be obtained from the following formulae: 


Xo = =(4)— (B) + (AB) + X... 


Xn = A= 08) == 3E X... 


Xa = =A) (P) — (AB) + X... 


, 


and 
See (A) + (B) + (AB) + X... 
o е 4 , 
where X... is the total for the experiment. In general, 
X, ШКА) + (DB) + (-D*(AB) + X... 
drca 4 
_ (4): + SE (AB) | =з (VIL.13) 
since 


(A) = (A): — (А), = 2[(А), — X.../2] 
= 2[X.../2 — (А)»], etc. 


Example VII-1. Naik [214] studied the effect of various types of budding on citrus 
trees. The particular example chosen to illustrate the analysis for a 2 X 2 factorial 
represents two amounts of wood on the bud of acid lime scions in combination with 


two times of cutting the Kichili orange rootstock arranged in a randomized complete 
block design. The four treatments are 


00 = bud inserted with a thin slice of wood adhering to it. The rootstock seedling 
was lopped off about 3 inches above the bud insertion after the bud had 
grown 2 inches or more. 

01 = bud inserted without wood with the same treatment on the rootstock seedling _ 
as for treatment 00. р 

10 = bud inserted with thin slice of wood adhering to it. The rootstock seedling 

was cut off immediately after the bud insertion. 


1l — the same bud treatment as for 01 and the same rootstock treatment as for 10. 


\ The data presented in table VII-1 are the percentages of success of budding acid 
lime buds on Kichili orange rootstocks. It will be assumed that all percentages are 
derived from an equal number of trials! and that the value for treatment 11 in block 
e percentages are derived 


‚ Uf th from unequal numbers of observations Cochran [51] 
discusses th 


e procedure to follow in computing the analysis of variance. 
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I represents an actual percentage and not a missing plot value; at the conclusion 
of the present analysis, this value will be estimated and the analysis carried through 
with the estimated value in the analysis. Also, the percentage data were rounded to 
whole numbers. The arcsine transformation on the percentages was used [273, table 


TABLE VII-I. Percentage “bud-take” of acid lime scion on Kichili orange rootstocks; 
tables of effects and the analysis of variance 


X’ = percent; X = sin? VX’ 
^X" taken equal to 100/4(20) = 1.25% 


Effects 


Treatment totals 


Xoo7 X у aT 


.00 ОЛ 
232.49 105.96 158.05 84.27 


B = Wood vs no wood 
А = late vs early 


lopping 
Interaction 


Total 


Block 


208.14 


Treatment 3213.6} 
В = Wood vs no wood on buà 2265.1} 
= Late vs early lopping 704.24 
AB = Interaction 246.25 


Sra 527.73 


a 3949.51 
Orrection for mean 1 20359.72 


T 


Total 
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16.8; 102, Ch. 16]. Bartlett suggested that the value 100/2n [12] or the value 100/4n 
[14] be substituted for zero percentage. The latter value was added to zero before 
making the transformation. 

The totals, effects, and the analysis of variance are presented in table VII-1. The 
coefficients in the table of effects are those given by equation (VII-9) multiplied by 
minus one for the A and B effects. The coefficients for the interaction are the same. 

The A effect represents the comparison of early and late lopping or cutting of 
the rootstock seedling above the bud insertion; the total A effect is (А) = (A) — (А): 
= (232.49 + 105.96) — (148.03 + 84.27) = 338.45 — 232.30 = 106.15. The effect of 
including wood or no wood with the bud is (В) = (B)o — (В), = 380.52 — 190.23 
= 190.29. The interaction is calculated in a similar manner. It should be pointed out 
that (A) could have been defined as (A): — (A)o, and then the effect would have a 
negative sign. Likewise, the treatments could have been redesignated, resulting in a 
positive A effect. Since the designation is arbitrary, no confusion should result if the 
main effects are the negative of those given by equation (VII-9). The interpretation 
is made with this in mind, 

The replicate, treatment, and error sums of s 


quares are computed in the usual 
manner for a randomized compl 


ete block design. The treatment sum of squares, 
232.49? + 105.962 48.03? „21% 15? 
T 9 + 1 + 84.27 4 51616 = 3213.64, 


may be partitioned into sums of Squares with individual degrees of freedom (equations 
(VIT-10) to (VII-12)). For these data the sums of squares due to the various compari- 
sons are 


Wood vs no wood on the bud [equation (VII-11)]: 


(190.29)? e А 
q+1+14D) = 2263.14 with one degree of freedom. 


Lale vs early lopping of rootstock seedling [equation (VII-10)]: 


Wee = 704.24 with one degree of freedom. 


Interaction of time of pruning and amount of wood [equation (ҮП-12)]: 


2 
x = 246.25 with one degree of freedom. 


The sum of the three individual sums of squares 2263.14 + 704.94 + 246.25 = 3213.63, 
with 1 + 1-Е1 = 3 degrees of freedom, is equal to the treatment. BAD. аага 
within rounding errors, 
e use F to test the hypothesis of zero effects and reject at the 5 per cent level; 
_ 2263.14 
F= 58.637 3860 > Fo(1, 9df), 
704.24 у 
Е = = 
рУ 58.637 = 12.01 > Fo(1, 9df), 
Е — 246.25 


58.637 = 420 < Fos(1 9df) — 5.12. 
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Alternatively, we could use the £ test to test the same hypotheses. It is simpler to 
compute a least significant difference for the effect totals given in the central portion 
of table VII-1 than to compute the various F values. The standard error of these effect 
totals is 


з = Vere? = 58.637(4)(1 +1 +1+ 1) = 30.63, 


and the significant difference at the 5 per cent level is fos(9df)s¢ = 2.262(30.63) = 69.29; 
the с; are the coefficients for the particular comparison. As before, the A and B effects 


TABLE VII-2. Effects and analysis of variance for data of table VII-1 when the value for 
treatment 11 in block I is estimated 


Effects 


Treatment totals 


158.05 100.86 


338.45 
380.52 
555.55 
587.34 


as 1 
Ёггог ge 
Correction for mean 1 

"One value estimated from the data. 


exceed the lsd, while the interaction effect does not. The confidence intervals on the 

effects are computed from the effect totals and the lsd. 

for At the beginning of the discussion of this example, it was assumed that the value 

th treatment 11 in block I was zero. Instead of making this assumption, suppose 

Ey 2. assume that the datum for this particular experimental unit was accidentally 

in Ch ince the design is a randomized complete block design, the missing plot technique 
apter V may be employed to estimate this value: 


Жы = 402142 — 6.42) + 4(84.27 — 642) — (570.75 — 6.42) _ 23.01 
4—1)4—1) be 
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The value 6.42 is subtracted from each category to obtain the totals without the value 
6.42. We insert the value 23.01 in the table of yields and compute the effects and analy- 
sis of variance as given in table VII-2. The block I total now becomes 138.01, the total 
for treatment 11 = X. becomes 100.86, and the grand total equals 587.34. All effects 
exceed the F value at the 5 per cent level. It should be remembered that these tests 
are slightly biased (see Chapter V). 

The computation of a significant difference for the effect totals presents some 
difficulties, since the contrasts for the effects are no longer independent. The yariance 
of a difference between a treatment mean, say 2.11, with a missing yield, say Xiu, and 
any other treatment mean, say 2.00 is: 


А а АСЕИН 2(—1) 
У(Х. + Xu)/(r — .00)] = "Les 5 A eme 


232 v 
fassen e 


The variance of a difference between two treatment means with no missing yields, say 
Z.oo and Z.10, is: 


VOCE ыў a +1- 20} = 203. (VII-15) 
The estimated variance for the main effect total for A = 338.45 is computed as: 
24283 ron Е 2(—1— 1 — 1) 
"ә дү =; п 001) — 20+0+ 0) 


Е pen Oe m ; 
= 16066206 + 36 + 36 = 3 = 20(46.62) = 932.40; 
the least significant difference at the 5 per cent level is: 


2.306 V 932.40 = 70.41. 
The total effects for B and AB have the same least significant difference. 
The means from tables VII-1 and VII-2 are presented graphically in figure VII-4. 
The responses are presented graphically if the interaction mean Square is significantly 


2l 250 
| 

200r 200 
z 150 by = 150 by 
2 E 
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Figure VII-4. Graphical presentation of totals from tables VII-1 and VII-2. 
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larger than the error mean square or if it is desirable to present a pictorial representa- 
tion of the response. 


VII-4.2 THE 2° FACTORIAL 


2^ series is the 23, which involves 


The next factorial arrangement in the 
Js in all combinations. The eight 


three factors, a, b, and c, each at two leve 
treatments are 
000, 100, 010, 110, 001, 101, 011, 111, 


and the levels of the main effects and interactions are obtained from the 


following combination of treatments: 


(А). = 000 + 010 + 001 + 011; 
(A) int = 100 + 110 + 101 Je quts 
(В) = 000 + 100 + 001 + 101; 
(В) = 010 + 110 + 0H +11: 
(AB) о-о = 000 + 110 +001 + 111; 
(AB)izjar = 100 + 010 + 101 + 011; 
(С) nwo = 000 + 100 + 010 + 110; (VII-16) 
(G) ner = 001 + 101 + 011 + 111; 
+ 010 + 101 + 111; 


(ABC) һо = 
(ABO) ai = 1 


The main effects and interactions are the contrasts of two sums; thus: 


(A) = (A): — (A4 = ias = (у: 


(В) = (В), — (Bw B = Læ n Bx}: 
; (VII-17 
(AB) = (АВ) — (АВ): AB = 2 (AB) = (Abd: ) 


(ABC) = (АВО) — (АВС); ABC = 1 (аво), = аво»), 


Where the levels of the effects аге summed over the r replicates. 
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In a like manner the effects may be obtained from the following table: 


Treatment combinations and totals 


Effect 000 010 110 001 101 011 111 
«000 010 X 110 Xo Xaia Xon X n 

Total + + + + + + + 
A = E. at E i = + 

B - E = - - + + 

АВ + - + + = = + 

c = = - t 4 + + 

АС + + - 2 H - + 

BC + a E. = E ae + 

ABC = + = + = - + 


where the totals of each column are added or subtracted as indicated. For 
example, 


(AC) = X00 — X.» + Xoo — X.no — X. + Хаа — Xoo + Х ап 
= (А0), — (AC). (VII-18) 
The sum of squares for AC is 


КАС), — (АС): 
QTITIETITITITIEYIy (VII-1) 


where r equals the number of replicates involved in obtaining the totals 
X -ijn The remaining sums of squares for the effects are obtained similarly. 
A third computational method for obtaining the main effects and inter- 


actions has been presented by Yates [324]. The steps are presented in the 
following table: 


Treatment | Yield Col. 1 Col. 2 Col. 3 


Effect total 


ie ee | se | ie 
000 Х | Хо +Xaw=s, | s +s | з +5 +s +s | Total = Хх... 
100 Хе» | Xow + Xa = & | Sts: | sstssts;+s, (A) 

010 X.no Ха + Xan =з Ss + Se S2 — Sı + 8; — 83 (B) 

110 Хо Xon + Xan = 5; 8; + Ss 56 — Ss + 58 — 8; (AB) 

01 ы ш == ж = 85 52 — Sı 5з +s — Sı — 82 (С) 

101 “110 — 4.019 = S 5$ — 5 с. ны 
i xm val vua 3 4 3 87 + Ss — Ss — 5 (AC) 


Pm xn 57 56 — S5 Si — S: — 82 + 5 (BC) 


The sum of the item: 
the last column. Th 
used as in the two p 


s in a row of column 3 yields the effect totals listed in 
© same combinations of treatments for the effects are 
receding computational methods. 
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The plus and minus signs in the above table may be obtained from the 
following algebraic quantities [126, sec. 43; 324, p. 12]: 


(A) = (a—1) (6+) (c4-1) = —1-2-a— b--ab— c--ac— bc-- abc 
(B) = (a--1)(b—1)(c4-1) = —1 —a+b+ab—c—ac+be+abe 
(AB) = (а—1)(0—1)(с+1) = 1—a-—b--ab-4-c— ac — bc-- abc 


(VII-20) 


(АС) = (а—1)(6+1)(с—1) = 1 —a4-b — ab — c-4-ac — bc-- abc 


(ABO) = (a —1)(b—1)(c—1) = —1+a+b—ab+c—ac—be-+abe, 
where 1 = X.w, а = Хш b= Xow, ab = X.no c = Xan, ac = Хао, 
be = Х.оц, and abe = Хоц. 

The standard error for any effect total is ~/rs*(1+1+1+1+1+1+1+1) 
= 92s4/2r, where s? equals the error mean square for the particular experimental 
design used. The standard error for a mean effect is s/ мг. 

The treatment totals may be obtained from the effects or from the levels 
of the effects in a manner similar to that for the 2 X 2 factorial; 


Х. = M. + (-D9(4) + (—1)^(В) + (AB) + (OGON 
+ (AC)(—1)#* + (ВС)(—1)+* + (ABO) (-D 9 


= Hax + (B); + (АВ): + (Oo + (АС) + (Вб) 


+ (АВС) дь — 3X... (VII-21) 

Example VII-2. An experiment was conducted to study the effectiveness, use, 
and preparation of stereophotographic aids in engineering education [282]. Only a’ 
portion of the experiment is used to illustrate the analysis for a 2° factorial. The part 
of the experiment reproduced (table VII-3) is on the portion dealing with non-stereo- 
photographs. 

In setting up the experiment, Straub [282] obtained twenty-four senior and twenty- 
four (actually twenty-eight, but four were excluded at random in order to have equal 
numbers in the analysis) freshman students. In each class the students were divided 
into four groups of six students each. The groups were “equated” or “balanced” on 
the basis of six characteristics. There were no apparent relationships (all values of 
the correlation coefficient are not significant at the 5 per cent level) between any of 
the six characters used for equating the groups and the percentage of correct answers. 
Hence, for our purposes, it is assumed that the six students to receive each treatment 
е design is a completely randomized one. The practice 


were randomly selected and th : 
of “equating” or “balancing” the subgroups is not a recommended procedure for the 


reasons set forth in Chapter I. 
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The three factors are class = c, picture and question set — b, and position of 
exposure — a. The freshman group is designated as co.and the seniors аз с. The bo 
group comprised a set of pictures on which twenty-seven questions were asked, and 
the b; group was composed of another set of pictures upon which eighteen questions 


TABLE VII-3. Original and transformed data for the 2 factorial 


161.8 184.2 211.8 
30.30 33.12 30.70 35.50 


4 questions answered correctly; W = number of questions answered 
incorrectly. 
= 1.39% [12, 14, 102, Ch. 16]. 


Total main effects and interactions for transformed data 


Х ооо Хуу X ig Xo Xo 
203.4 175.0 179.0 181.8 198.7 


Xam Xon Xan 
184.2 211.8 256.8 


= + = + 
= + - 

- с + $ x t z 3 
E e z * * - & + 
б; z c T. X n 

* - + к. a К + + 
+ + = = E: E ^ * 
= + + - + & $ M 
* + + + A T " t 
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The 2° = 8 treatments are 


000 = left exposure, photo and question set I, freshman; 
“ “ “ « К 


l00.——ight 4 „# 1, ; 
010 = left “ “ 4 “ “ TIS “ ; 
1lo=sight € s Seas LESE UICE Жа 
001 Jub © „ше 56 Ld * T, senior; 
101 = right “ р “ “ “ “ T “ : 
011 = left ere £ V CIR CC. n 
WL= right " , © Se Г 


Since the data are in two categories, numbers of questions answered correctly 
and incorrectly, it appears that the percentage of correct answers might be binomial 
in nature; this would indicate that an arcsine transformation of the data might be 
appropriate. The situation is complicated by the fact that the percentages are derived 
from unequal numbers [51]. The problem of appropriate weights presents some 
difficulties. In this connection, however, the only comparison derived from an unequal 
number of questions is the B effect. In order to make a start on this problem, the per- 
centage of correct answers was transformed to angles by the arcsine transformation 
(273, table 16.8]. The individual variances on each group of students on the eight 
treatments were computed (table VII-4). The chi-square test for homogeneity of 
variances [14; Chapter IV] was not unusually large (P = .3). However, the average 
variance, 127.9, for the four groups answering eighteen questions was significantly 
higher than the average variance, 47.1, for the remaining four groups which answered- 
twenty-seven questions. If binomial variance is assumed, the latter variance is much 
closer to its theoretical variance [51, 81; 102, Ch. 16], 821/27 = 30.4, than is the former 
one to its theoretical variance, 821/18 = 45.6. Owing to the fact that the variation 
among the individual subgroup variances was not unusually large, "and because the 
magnitudes of the variances for the same groups were reversed in an experiment on 
Stereo pictures (see problem VII-2), it will be assumed that the pooled within-group 
mean square may justifiably be used to compare the various contrasts. It is realized 
that more study of these data is required. If the errors going into the В contrast, say, 
аге considered to be different, a modification of the error degrees of freedom is neces- 
sary (Chapter V). 

The main effect and interaction totals in table VII-3 ai 
the plus values and subtracting the sum of the quantities wi 
(ҮП-17)). For example, 


(А) = (А), — (A)o = 797.8 —.792.9 = 49, У. 
(АВ) = (АВ), — (AB): = 840.1 — 750.0 = 9 2 
and (АВС) = (ABC) — (ABO) = 809.5 — 781.2 = 28.3. 
total, and within treatments 
randomized design (Chapter 


3 are obtained by summing 
th minus signs (equation 


The sums of squares in table VII-4 for Кү 
are computed i described for a comple 2 EH A 
IV) The teneret sem of squares Pid Tr degrees, е ere ote 
Seven independent sums of squares, each MILES Trot is 4.93/8(6) = 0.500 
9rmulae similar to (VII-19) the sum of squares for the А 90.72/48 = 171.385, ete. i 
ог the B effect is 68.12/48 = 96.617, for the AB effect 1s °° : 
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The various mean squares may be compared with the error mean square, 87.514, 
to test the hypothesis of zero effects. None of the effects exceeds the 5 per cent level 
of F. The class difference exceeds F at the 10 per cent level. However, this is about 
what would be expected in making multiple F tests; 10 per cent, or one of the effects, 
would be expected to exceed Fi, on the average. This point should be kept in mind 
when making F tests of a large number of hypotheses [see 134]. 

The standard error of an effect total is +/87.514(6)(8) = 64.81, and the significant 
difference at the 5 per cent level is 2.021(64.81) = 131.0, where 2.021 is the tabulated 


TABLE VII-4 Analysis of variance for transformed data from table VII-3 


Source of variation 


Treatment 
A = Exposure 
B = Set 
AB = Exposure x set 
С = Class 
AC = Class x exposure 


BC = Class x set 
ABC 


7 Class x set x exposure 
Anong boys within treatments 
Within 000 
100 
010 


193.34 
334.19 
887.57 
932.34 
208.27 
206.86 
531.34 
206.64 


\л \л \л \л \л \л м м 


Total 


Correction for mean 


Total uncorrected 


value of t with 40 d 
intervals are the е 


egrees of fre 


аш ffect totals pl 
tion is V/91.514/33 14 = 28 


edom at the 5 per cent level. The 95 per cent confidence 
us and minus the value 131.0. The coefficient of varia- 
? i iation, it might 
be profitable t Per cent. With such a high degree of variation, it mug 
are needed, AN STE of reducing the variation. Perhaps more questions 
he randomized 
PS did not 


Hence, others Deed to be Een аг to be of much value for stratification purposes. 
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if desired the treatment totals may be computed from the effects or the levels of 
various effects. Applying formula (VII-21), the total for treatment 001 is 


1 
X. = 4792.9 + 761.3 + 840.7 + 851.5 + 767.3 + 743.7 + 809.5 — 3(1590.7)] 


= 198.7. 


The other treatment totals may be computed similarly if desired. This method of 
obtaining treatment totals is used for incomplete block designs (see later chapters) 


but it is not ordinarily used to obtain treatment totals. 


VII-4.3 THE 2‘ FACTORIAL 
The 2 factorial arrangement of treatments involves four factors, a, b, 
c, and d, each at two levels. The sixteen treatments are designated as 


(1) = 0000 c = 0010 d — 0001 cd — 0011 
a — 1000 ac — 1010 ad — 1001 acd — 1011 
b — 0100 bc — 0110 bd — 0101 bcd — 0111 

ab — 1100 abc = 1110 abd — 1101 abcd — 1111. 


The letters are the symbols used for the various combinations by a number of 
authors [60, 273, 324]. The levels of the effects are equal to the following 


combination of treatments a;b;cd; = Uhf: 
(А); = 0000 + 0100 + 0010 + 0110 + 0001 + 0101 


+ 0011 + 0111; 
(А); = 1000 + 1100 + 1010 + 1110 + 1001 + 1101 


+ 1011 + 1111; 


(AC) ace = 0000 + 0100 + 1010 + 1110 + 0001 + 0101 


+ 1011 + 1111; 
(АС): л = 1000 + 1100 + 0010 + 0110 + 1001 + 1101 
+ 0011 + 0111; (VII-22) 


(BOD),, у. = 0000 + 1000 + 0110 + 1110 + 0101 + 1101 


+ 0011 + 1011; 
(вор), ууа = 0100 + 1100 + 0010 + 1010 + 0001 + 1001 
+ 0111 + 1111; 
(ABCD) ss, у = 0000 + 1100 + 1010 + 0110 + 1001 + 0101 
+ 0011 + 1111; 
(ABCD) ьа = 1000 + 0100 + 0010 + 1110 + 0001 + 1101 
+ 1011 + 0111. 


18 


188 Factorial Experiment—pn Series [8 VII-4.3 


An effect is obtained as the difference between the levels of an effect; thus: 


(A) = (4), — (A); А = {(4), — (A)o} /8r; 
(АВ) = (АВ), — (АВ); АВ = {(АВ), — (AB).]/8r; 
(VII-23) 


(ACD) = (ACD), — (ACD); ACD = (ACD)/8r; 


(ABCD) — (ABCD), — (АВСР),; ABCD = (ABCD)/8r. 


Alternatively, the effects may be obtained from a table of plus and minus 
signs (table VII-5). 

The fifteen treatment degrees of freedom may be partitioned into fifteen 
independent comparisons with single degrees of freedom; thus, for г replicates, 
this part of the analysis of variance is 


Source of Degrees of 

variation freedom Sum of squares 

A 1 [(А), — (A)o]?/16r 

B 1 [(B) — (В),)2/16г 
AB 1 [(AB)o — (AB);]*/16r 
c 1 С), — (С), ]2/16г 
АС 1 [(AC)o — (AC), ]/16г 
вс 1 [(BC), — (BC), }*/16r 
ABC 1 [(ABC), — (ABC)o]?/16r 
D 1 [(D) — (D)o}?/16r 
AD 1 [(AD)o — (AD); )2/16г 
BD 1 [(BD), — (BD), }?/16r 
ABD 1 [(АВР), — (ABD)o]?/16r 
Ор 1 [(CD), — (Ср) 2/16: 
ACD 1 [(ACD); — (ACD);?/16r 
BCD 1 [(BCD), — (BCD)o]?/16r 


m 


ABCD [(ABCD), — (ABCD), }?/16r 


is standard error for the effect totals in the right-hand column of table 
> uis ` rs?(16), where 5° is the experimental error variance. The total for 
icular treatment may be obtained from the effect totals or from the 


levels of the vari i 
1005 зс i i is simi 
idi (УП). main effects and interactions. The formula is similar to 


Factorials of the 2» 


ut Series f. i in a 
manner similar to that or п larger than four may be obtained i 


described for the 2°, 23, and 2! factorials. 


te 


+ + + + + + + + + + + + + + + + 


+ - - + + 


- + - + - * 


329JJ9 Utt. TTO y "oU. ттоо'у TOIT’ 
лој 


poqu Pq pow рә раз 


х тото’; TOOT" T000" y оту OTTO" y or. отоо" оотт'у ooto" 


ра 


pe 


P aqe aq оз ə аз а 


181107987 ,c В Jo syoaya [030] 207 Чу” 810 10J sjua: 


х ооо” Y 0000* 


328333 
= (1) 


90) "S-TIA ATAVL 
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VII-4.4 GENERALIZED INTERACTION IN THE 2» SERIES 
The concept of a generalized interaction has been discussed in detail by 

Yates [324] and by Fisher [126, sec. 45.1]. As illustrated earlier the inter- 
action of А and B is AB. Likewise, the interaction of А and BC is ABC, and 
of A and AB is B. The latter example A X AB = А?В = B, reduced modulo 
two, illustrates the fact that 0 and 1 are the only elements of this number 
system and that A? — A? — 1, since any number to the zero power is unity. 
Some further examples will illustrate the method and principles involved: 

ABC x BCD = AD; 

ABC x DEF = ABCDEF; 

ADE x AFG = DEFG; 

BDF X BCFG = CDG; 

BDF х ABC = ACDF; 

AB x BC = AC; 

C x BCDF — BDF; 

ABCD x ABC - D. 


The concept of generalized interactions is useful for clarifying the construc- 
tion of various incomplete block designs. 


VII-5 The Factorial Experiment —3" Series 


The 3^ series is constructed in much the same way as the 2”. There are 
n factors each at three levels in а 3” factorial arrangement. The number 
system here is 0 = 3 = 6 = ··..,1=4= 7 = ·.:, =5=8=..., 
ҮП-5.1 THE 3? FACTORIAL 


For the 3? factorial the factors a and b are at the 0, 1, and 2 levels and 
are in all possible combinations; this results in the nine treatments: 00, 01, 
02, 10, 11, 12, 20, 21, 22. The 0, 1, and 2 levels of effects А and B are 


(A)o = 00 + 01 + 02; 
(A): = 10 + 11 + 12; 
(4), = 20 + 21 + 22; o 
(B), = 00 + 10 + 20; 
(В). = 01 + 11 4+ 21; 
(B) = 02 + 12 + 22. 
g the three levels of the factors A or B from r replicates 
es with 2 degrees of freedom, 


(A)? + (A)? 2 
== _ а) + CE + (A)? (VII-25) 


The comparison amon; 
yields a sum of squar 
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or 


(B)? + (B)? + (B)? _ [(В» + (В) + GA". (VII-26) 
9r 


3r 


Among the nine treatments, there are 8 degrees of freedom; thus, there are 4 
degrees of freedom for the interaction of the factors a and b (denoted as 
A X B). It is possible to partition these 4 degrees of freedom into two sepa- 
rate portions, which, in conformity with the code given below, are denoted as 
AB( = J effect in Yates’ notation) and АВ = I effect in Yates’ notation). 
This partitioning may have little or no meaning experimentally but is im- 
portant in constructing incomplete block experiments. The three levels of 
AB and AB? are [324, p. 40] j 


(AB) i4j00 = 00 + 12 + 21 = [Ji]; 

(АВ): уа = 01 + 10 + 22 = [J2]; 

(АВ): = 02 + 11 + 20 = [5]; (VIL27) 
(AB?) о = 00 + 11 + 22 = [1]; 
(AB?) азу = 02 + 10 + 21 = [Ze]; 
(AB?) 4202 = 01 + 12 + 20 = [Js]; 


since the notation is reduced modulo three, and 0 = 3, 1 = 4, and 2 = 5. 

The first letter in the interaction has the coefficient of unity and the second 
letter has either unity or 2 = 3 — 1. This is done to obtain p + 1 independent 
and unique effects. For example, the three levels of the effect А?В are com- 
posed of the same treatments as are the levels of AB? = (AB?* = A?B, 
reduced modulo three. The notation is merely an extension of that used for 
the 2^ series where the partitioning of the treatment sum of squares is in 
units of (p — 1) 2 2 — 1 — single degrees of freedom. For the 3” series the 
partitioning of the treatment sum of squares is in groups of 3 — 1 — 2 de- 
grees of freedom. This system may be extended for p — 5, 7, 11, 13, etc. 
[173, 175, 177]. 

As in the 2" series the total for treatment ij from a3 X 3 factorial may 
be obtained from the formula, 


Em (4): + (B); + OB isi + (АВ?) _ ыы (VII-28) 


X. 
Which involves levels of the main effects and interactions. 


Example VII-3. Mahalanobis [203] presents a numerical example of three rice 
Varieties in combination with no inorganic fertilizer and two inorganic fertilizers, 
cre and ammonium sulphate at 30.4 pounds per acre. 


A randomized complete block design with six replicates of the nine 1/93 acre = 12 
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X 39’ plots was used. The yields of the plots are given in chattaks (Indian unit of 
measure) per plot (table VII-6). 


TABLE VII-6. Yield of grain from paddy grown in 39' X 12' plots (1/93 acre) for 9 
treatments 


Treatment 
Fertilizer 


Variety 


Ammophos 
Ammonium sulphate 


Red Aus | 
Control 


Ammophos 
Kashiphul Ammonium sulphate 
Control 


Ammophos 
Dudkalma Ammonium sulphate 
Control 


Total 


Ammophos = 8, 
Ammonium sulphate = a) 


Control (no fertilizer) = a 


Replicate 
Treatment 


fertilizers 
varieties 
AxB 


Before computin, : 
cate, treatment, and Pone of variance table, it is necessary to obtain the repli- 
totals. The sums of Squares ie B and to construct a table of variety and fertilizer 
eatments, replicates, and total are computed in the^ 
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manner set forth for the randomized complete block design (see Chapter V). The fer- 
tilizer (the a factor) sum of squares is obtained from formula (VII-25) as 


1933? + 1783? + 1674? _ 5390? _ Б 
E Omer 1878.9 with 2df. 
The variety sum of squares (formula (VII-26)) is 


21362 + 1484? + 1770? _ 5390? _ 1867.7 with 2d 
3(6) 54 11867.7 wi lf. 


The variety X fertilizer interaction, А X B, is usually obtained by subtracting the 
above two sums of squares from the treatment sum of squares, 17459.8 — 1878.9 
— 11867.7 — 3113.2 with 4 degrees of freedom. This sum of squares may be computed 
directly from the sum of squares for the two components AB and AB*; thus: 


2 2 2 


а 1 {ав - 609 + 698)? + (705 + 708 + 520)? 
+ (730 + 466 + 456)? + (698 + 466 + 705)? + (498 + 708 + 456)? 


5390)? 
+ (730 + 520 + — = 26300" 


= 540201.0 + 539515.9 — 2(538001.9) 
= 3713.1 with 2 + 2 = 4 degrees of freedom. 
d to be estimates of the same quantity, 


Since the two sums of squares are considere 


the results are pooled in the analysis of variance table. г e 
Making use of the F test it is seen that the treatment mean square 18 significantly 


larger than the error mean square, F — 2182.5/567.84 = 3.84 > Fos(8,40df). Par- 
titioning of the treatment sum of squares into its component parts indicates that the 
variation among varieties accounts for the major portion of the treatment sum of 
Squares, 
The standard error of a variety or fertilizer mean is. 4/561.84/6(3) — 5.62. The 
Standard error of the corresponding total is 4/567.84(6)(3) = 101.10. The correspond- 
ing standard errors of a difference are the above standard errors multiplied by 4/2. 
The coefficient of variation equals 4/561.84/99.8 = 24 per cent. The confidence 
intervals are computed as described in section II-1.1.4. , з 
The analysis of the above factorial has not been completed. We might wish to 
Ow the reaction of the two kinds of fertilizers with varieties, the comparison between 
the two kinds of fertilizers, a; and аз, the contrast of the two nitrogen fertilizers with 
the check, and the interaction of this contrast with varieties. The sums and totals 
Decessary to obtain the sums of squares for the above contrasts are presented in 
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table VII-7. The first table of totals consists of computing the quantity a» — a; for 
each replicate and for each variety. The first figure is —56 — 112 — 168, the second 


figure is 12 — 128 — 116, etc. The totals are summed algebraically to obtain the 
variety and replicate totals. The second table of totals is computed as аз + а — 2a». 


The first value is 112 + 168 — 2(106) = 68, etc. The variety and replicate totals are 
obtained. The differences between fertilizer totals are equal to the totals of these 


TABLE VII-7. Comparisons among fertilizers 


Fertilizers 

Ко ёт. 

еы т 
Ахв 

(a, - a) x var. 

(a, ta- 2а) x var. 
Error 

Ver. x rep. 

Fert. x rep. 


(a, - a) x var. x rep. 9175.0 


5578.3 


(a, + 8, - 280) x ver. x rep. 


quantities i 

SIGNI SES 88 thus, 1933 — 1783 = 150, and 1933 + 1783 — 2(1674) 

Squares for fertilizers: es of Squares of these two quantities equals the sum of 
> aus, 10/0 + 1)(6)(3) + 3682/6(3)(1 + 1 + 4) = 625.0 
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-+ 1253.9 = 1878.9. Likewise, the totals for varieties may be used to compute the 
interaction of the two contrasts with varieties, 


22? +32? +96? 150? , 42? + (—76)? + 4022 _ 368° 
6(1 + 1) 36 60 +1+4 108 


= 268.7 + 3444.5 = 3713.2. 


Asa further investigation, it was decided to compare the interaction of the various 
е four mean squares are not different, although there is a 


contrasts with replicates. Th 
ean squares are somewhat less 


slight indication that the main effects by replicates m 
variable than the interaction by replicate mean square, А X B X replicates. However, 


this may be a fortuitous event. 
The nitrogen fertilizers yield higher than no fertilizer, although not significantly 
higher, and the interaction of this contrast. with varieties is close to significance at the 


5 per cent level; thus: 
F = 1122.25/516.84 = 3.03 < Fos(2, 40df) = 3.23. 
The interaction of fertilizer versus no fertilizer with variety is presented graphically 
in figure VII-5. 
1450 


Total yields 
a 
© 


bi bz 
Variety 
n of the contrast az + a1 — 2a with variety. 


bo 
Figure VII-5. Interactio! 


sirable to partition the 2 degrees of freedom for 


y be de 1 
Bz, Ag, and Во) as described 


In some experiments it ma 
TE d quadratic effects (Az, 


A and (or) for B into linear an 
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by Snedecor [273, Ch. 15]. The following table of plus and minus coefficients for single 
degree of freedom comparisons in a 3 X 3 factorial has been prepared for this purpose: 


Treatment totals 


Beet. twee ese Test Tess IX TX xu | шуна 
n» = ede x redes |e + бг 
As ie me dose ea eese Е aL | aes 
вё ee SEE IS йу ses WS E LE 
B, ЕКЕ EC Ун EE ses: 
АХВ, | + Les 0 0 (res est 4r 
Ga || TRIS t» || vero feel ЕЕ d vet БЫ 
Ag X B, — 0 + +2 0 —2 — 0 + 12r 
Ass is red eese Е ЕЕЕ E У E || Ser 


VII-5.2 THE 3* FACTORIAL 


It may be desired to test three factors a, b, and c in all combinations of 
three levels per factor. The resulting twenty-seven treatments may be in- 
cluded in a randomized complete block or some other design. The treatment 
combinations are 


000 001 002 010 011 012 020 021 022 
100 101 102 110 11 112 120 121 122 
200 201 202 210 211 212 220 221 222 


The levels of the effects in terms of the treatment combinations are 


(A), = 000 + 001 + 002 + 010 + 011 + 012 + 020 + 021 + 022; 
(A), = 100 + 101 + 102 + 110 + 111 + 112 + 120 + 121 + 122; 
(A). = 200 + 201 + 202 + 210 + 2114 212 + 220 + 221 + 222; 
(B), = 000 + 001 + 002 + 100 + 101 + 102 + 200 + 201 + 202; 
(B), = 010 + 011 + 012 + 110 + 111 + 112 + 210 + 211 + 212; 
(B), = 020 + 021 + 022 + 120 + 121 + 122 + 220 + 221 + 222; 
(AB), = 000 + 001 + 002 + 120 + 121 + 122 + 210 + 211 + 212; 
(AB); = 010 + 011 + 012 + 100 + 101 + 102 + 220 + 221 + 222; 
(АВ)» = 020 + 021 + 022 + 110 + 111 + 112 + 200 + 201 + 202; 
(AB?), = 000 + 001 + 002 + 110 + 111 + 112 + 220 + 221 + 222; 
(АВ?), = 020 + 021 + 022 + 100 + 101 + 102 + 210 + 211 + 212; 
(АВ?) = 010 + 011 + 012 + 120 + 121 + 122 + 200 + 201 + 202; 
(C), = 000 + 010 + 020 + 100 + 110 + 120 + 200 + 210 + 220; 
(C), = 001 + 011 + 021 + 101 + 111 + 121 + 201 + 211 + 221; 
(C). = 002 + 012 + 022 + 102 + 112 + 122 + 202 + 212 + 222; 
(AC), = 000 + 010 + 020 + 102 + 112 + 122 + 201 + 211 + 221; 
(AC); = 001 + 011 + 021 + 100 + 110 + 120 + 202 + 212 + 222; 
(АС)» = 002 + 012 + 022 + 101 + 111 + 121 + 200 + 210 + 220; 
(AC?) = 000 + 010 + 020 + 101 + 111 + 121 + 202 + 212 + 222; 
(AC), = 002 + 012 + 022 + 100 + 110 + 120 + 201 + 211 + 221; 
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(АС), = 001 + 011 + 021 + 102 + 112 + 122 + 200 + 210 + 220; 
(BC), = 000 + 012 + 021 + 100 + 112 + 121 + 200 + 212 + 221; 
(BC), = 001 + 010 + 022 + 101 + 110 + 122 + 201 + 210 + 222; 
(BC). = 002 + 011 + 020 + 102 + 111 + 120 + 202 + 211 + 220; 
(BC?), = 000 + 011 + 022 + 100 + 111 + 122 + 200 + 211 + 222; 
(BC?) = 002 + 010 + 021 + 102 + 110 + 121 + 202 + 210 + 221; 
(BC?) = 001 + 012 + 020 + 101 + 112 + 120 + 201 + 212 + 220; 
(ABC), = 000 + 012 + 021 + 102 + 111 + 120 + 201 + 210 + 222; 
(АВС), = 001 + 010 + 022 + 100 + 112 + 121 + 202 + 211 + 220; 
(ABC). = 002 + 011 + 020 + 101 + 110 + 122 + 200 + 212 + 221; 
(АВС?), = 000 + 011 + 022 + 101 + 112 + 120 + 202 + 210 + 221; 
(ABC?), = 002 + 010 + 021 + 100 + 111 + 122 + 201 + 212 + 220; 
(ABC2)s = 001 + 012 + 020 + 102 + 110 + 121 + 200 + 211 + 222; 
(AB:C), = 000 + 011 + 022 + 102 + 110 + 121 + 201 + 212 + 220; 
(ABC), = 001 + 012 + 020 + 100 + 111 + 122 + 202 + 210 + 221; 
(АВ?С)» = 002 + 010 + 021 + 101 + 112 + 120 + 200 + 211 + 222; 
(AB*C:), = 000 + 012 + 021 + 101 + 110 + 122 + 202 + 211 + 220; 
(AB?C2), = 002 + 011 + 020 + 100 + 112 + 121 + 201 + 210 + 222; 
(AB:C?), = 001 + 010 + 022 + 102 + 11 + 120 4- 200 + 212 + 221. 


E 


The comparisons among three levels of each of the p? + p + 1 = thirteen 
effects yield thirteen sums of squares, each with 2 degrees of freedom. The 
following breakdown of the 26 treatment degrees of freedom is possible: 


ee Бика — —— 


Source of variation Degrees of freedom 


Treatments 26 


Main effects 
A 2 
B 2 
Cc 2 
2-factor interactions Я 
AX Bape 24 
2 
AO Hr 
2 
Bx све [+ 
3-factor interaction 
ABC -JJ 2 
ABC? = П 
AXBXCyapec =Л 2{8 
ABC? = IJ 2 


Lir ox le ee 


where the JJ, etc. notation is that used by Yates [324] and others [e.g., 60]. 
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In a factorial experiment with no confounding of effects the two-factor 
and three-factor interactions usually are not partitioned into separate parts. 
The partitioning is useful in the construction and for the analysis of incomplete 
block experiments. The sums of squares are obtained as before. For example, 
the sum of squares for the AB?C effect from r replicates is 


(ABC? + CONSO (CERO M = (VIL-30) 


9r yields make up each total for the level of an effect and 27r yields make 
up the grand total, X..... 


VII-5.3 GENERALIZED INTERACTION IN THE 3" SERIES 


The concept of a generalized interaction carries over to the 3” series or 
to the p* series in general. Several examples of interactions, reduced modulo 
three, with the first factor at the first power are presented below: 


ae X АВ? = AB? = AtB: = АВ = A; 
AB x A?B* = A:B5 = В. 
{а X АВ = АВ = A‘B? = АВ?; 
А X AB = ARB = B = В. 
A X B = AB; 
M x В? = AR. 
AB х CD = ABCD; 
у x CD! = ABCD. 
АВ? x CD? = AB'CD'; 

n х CD: = AB'C:D! = ABCD. 
ABC? x CD? = АВС? = АВР; 
vr x C'D* = ABC:D* = ABCD. 
ABCD x ВС? = ABCD = А; 

т X B'CID! = ABCD, 
ABCD x CEFG = ABC'DEFG; 
ndn x CE T:G: = ABDEFG?, 


VII-6 Other Factorials of the р" Series 


Application of the ideas presented thus far enables the experimenter to 
set up 25, 2°, 27, etc. and 3*, 35, etc. factorial combinations. In lieu of using large 
numbers of treatments, the optimum level for some factors may be determined 
and then a p? or p? factorial used for the remaining factors. Also, some form 
of fractional replication (Chapter IX) may suffice. 

VII-6.1 THE 4 X 4 FACTORIAL 


The 4? factorial may be considered either as a 2 or as a 4 X 4. The former 
consideration would be useful in the construction and analysis of an incom- 
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plete block design. The latter consideration would be necessary if two factors, 
each at four levels, were tested in all combinations. For example, four methods 
of mixing cakes and four types of baking powder might be tried in all com- 
binations. The breakdown of the 15 treatment degrees of freedom from r 


replicates follows: 


Source of variation Degrees of freedom 
Replicates r—1 
Treatments 15 

Methods of mixing 3 

Types of baking powder 3 

Methods X types 9 
Error 15(r — 1) 
16г — 1 


Total 
The interaction sum of squares is obtained from the 4 X 4 table of treatment 
totals. 

In another type of 4” factorial experiment [273, Ch. 15] the experimenter 
may be interested in partitioning the main effect degrees of freedom into 
linear, quadratic, and cubic effects. In a like manner it is possible to obtain 
the linear X linear, linear X quadratic, quadratic X linear, linear X cubic, 
etc. comparisons by partitioning the interaction sum of squares into individual 


degrees of freedom. 


ҮП-6.2 THE 5 х5 FACTORIAL 


Since five is a prime number, the 5^ 
manner as the 2" and 3^ series. The breakd : 
a 5? factorial experiment with r replicates of a randomized с 
design is 
c Source of variation Degrees of freedom ' 


Replicate r Е 1 
"Treatment Е 


series may be handled in the same 
own of the degrees of freedom for 
omplete block 


16 


w 
Daaa aa 


ADA 24(r — 1) 


GEM зуп ошак — uuo 
25p— 1 

For each level of an effect, 5r yields are available. Therefore, the sum of 

2 


Squares for the A effect is 7 
(Aye + (Aje И) + Alt + Ae 5. (VII-31) 
5r 
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Vil-6.3 GENERAL CASE 


In general the A X B interaction in a p X p factorial may be partitioned 
into p — 1 effects, each with p — 1 degrees of freedom; thus, 


p1 
AX B=AB+ AB +... + ABP = SAB (VII-32) 
uel 


The A X B X C interaction in a p X p X p factorial may be partitioned 
into (p — 1)? effects, each with (p — 1) degrees of freedom; thus: 


p-1 p-1 
AXxXBxG- »» >; AB*C*. (VII-33) 

The treatments making up a level of an effect are determined from the 
formula i + uj for a p? factorial and from the formula i + uj + vh for the 
р? factorial. The above formula may be extended for p equal to any prime 
number. 

The p* — 1 degrees of freedom for a p? factorial may be partitioned into 
p +1 effects, each with p — 1 degrees of freedom. Likewise, the p-—1 
degrees of freedom for a р? factorial may be partitioned into p? + р + 1 
effects, each with p — 1 degrees of freedom. In general, the р" — 1 degrees 
of freedom for a р" factorial may be partitioned into р"! + P+- +p 
-+ leffects each with p — 1 degrees of freedom. 


VII-7 Experimental Designs For Factorial Experiments 
ҮП-71 EXPERIMENTS WITH REPETITION OF THE р" TREATMENTS 


The three experimental designs allowing for a repetition of a set of treat- 
ments discussed thus far are the completely randomized design, the ran- 
domized complete block design, and the latin square design. The first two 
designs may be used for any number of treatments, while the latin square de- 
sign is practical only for the 2?, 2°, and 3? factorials. Larger factorials in a latin 
square design require an excessive amount of repetition of the set of treat- 
ments. 

Ordinarily the 5°, 3%, 25, 2°, 4%, 34, 5%, etc., and sometimes the 2* and 4? 
factorial experiments, are not designed as randomized complete block experi- 
ments, since it is difficult, and often impossible, to obtain enough homogeneous 
material. Therefore, it is necessary to stratify a complete replicate into - 
homogeneous subgroups and to use confounding and incomplete block designs 
to control variation within a complete replicate (see later chapters). 


VII-7.2 EXPERIMENTS WITHOUT REPETITION OF THE p^ TREATMENTS 


d ай four-factor (or second- and third-order) interactions may 
ie E а x some factorial experiments. If this is true, these inter- 
with pooled with the experimental error. Knowledge of this may be 

ing up an experiment. For example, suppose it is desired to ob- 
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serve the effects of the fertilizer treatments—two levels of the four factors, 
nitrogen (n), potash (Ё), phosphorus (p), and lime (J), in all combinations— 
on the composition of the vegetation growing in peat and bog lands. The six- 
teen fertilizer treatments could be applied in a specified year and the effect 
on kind and amount of. vegetation observed over a period of years. Since it is 
difficult to transport the fertilizer to the experimental area, it may be decided 
lo use one replicate of the sixteen treatments. Several checks could be included, 
and the variation among the checks and the interactions for three and four 
factors could be used as the experimental error. The breakdown of the degrees 
of freedom in the analysis of variance for the sixteen treatments and four 
additional checks placed at random over the experimental area is 


А 
Source of variation Degrees of freedom | Source of variation Degrees of freedom 
ү 


N 1 L 1 
NL 1 

К, т KL 1 

NK 1 PL 1 

Р T Among 5 checks 4 9 

NP 1 3- and 4-factor interactions 5 

KP 1 Total 19 


For such a design, information is available on all main effects and two-factor 
(or first-order) interactions; 9 degrees of freedom are available for the error 
sum of squares. The standard error of a main effect is 4/E./4. As a further 
precaution, check plots could be placed between each plot to control (by co- 
Variance) some of the variation over the experimental area. With a covariance 
analysis, 8 degrees of freedom are available for the error sum of squares. 
Fisher [126, sec. 41] suggests this type of design for a 2° factorial experi- 
ment with the three-, four-, five-, and six-factor interactions to be used as 
error. In a design of this type the three-factor interactions could be excluded 
from the error, since there are sufficient degrees of freedom for the error term 
from the four-, five-, and six-factor interactions. If the interactions used as 
error are not actually zero in the population, a positive bias results in the 
estimation of the error variance [175, sec. 14.6]. If the true interaction effects 
are small relative to the error variance, little harm results from the above 


Procedure, 


CHAPTER VIII 


Other Factorial Experiments 


VIH-l The p X q X k --- Series of Factorials 


In the preceding chapter the discussion is limited to factorials of the p” 
type where the n factors are each at p levels. It may be undesirable to have 
all factors at the same level and, indeed, it is not necessary. The р" series of 
factorials are discussed separately from other factorials in order to introduce 
the modulo notation, which is suitable for the present usage of prime numbers 
or powers of prime numbers but is not suitable for nonprime numbers. The 
notational system of the preceding chapter is used wherever applicable in 
the remaining chapters. 

The main effects in a p X q and in a p X q X kare estimated in the same 
manner as described previously [324]. The interaction sums of squares are 
obtained from formulae (VII-1) and (VII-2), respectively; the method here 
is the same as described by Snedecor [273] for two- and three-way classifica- 
tions. The breakdown of the degrees of freedom is illustrated below for a 
p X q factorial and a p X q X k factorial. If there are p levels of one factor, 
say a, and q of another, say b, the breakdown of the degrees of freedom for 
an experiment in which the pq treatments are arranged in r replicates of a 
randomized complete block design is 


Source of variation Degrees of freedom 


Replicate r—1 
"Treatment ра = 1 
А p-1 
B q- 1 
АХВ (p - (9-1) 
Error (с — 1)(ра — D 
Replicate X A (r-1(p-1 
Replicate x B t- D(q— 1) 
Replicate X A X B (- 0р - (9 1) 


-O АННЕ _ 
The error sum of squares for all two-factor factorials may be partitioned in 
a ja manner to that described in the above experiment. 

s a further illustration of factorial experiments, suppose that three factors, 


а, b, and c, are at p, q, and k levels, respectively, and are to be tested in all 
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combinations. Suppose that the design is r replicates of a randomized complete 
block. Then, the breakdown of the total degrees of freedom in the analysis 


of variance is 


Source of variation Degrees of freedom 


Replicate = R r-l 
Treatment = T рак — 1 E 
А p-1 
B а= 1 
АХВ (р — (9 — 1) 
C k—1 
АХС (р —1)(К – 1) 
BXxXC (q — 1)(k — 1) 
AXBXC (p - D(a — D(k — 1) 
RXT (к — 1) (рск — 1) 


Total грак — 1 


The p — 1 degrees of freedom may be partitioned into p — 1 individual 
degrees of freedom if the comparisons are meaningful. If the factor a is applied 
in p different amounts, it may be desirable to estimate the linear, quadratic, 
and perhaps other responses. Also, the b factor may be partitioned into con- 
trasts with single degrees of freedom if the contrasts represent meaningful, 
and not fortuitous, comparisons. The interaction of the various contrasts 
may also be obtained (example VII-3). 

If s? = the error mean square, the standard error for a me 
factor a from a p X q X k factorial experiment composed of r replicates is 


V/s*/rqk. Likewise, the corresponding standard errors for the mean level of 
the factors b and c are ^/s?/rpk and 4/s*/rpq, respectively. The standard 
error of a mean for a particular treatment, say -ijn is Sh 


an level of the 


VHI-L THE 2" x 3: SERIES 

A useful type of factorial arrangement is 
levels and n factors each at two levels. Thus, for n = 1 = s, we havea2 X3 
factorial composed of two levels of factor a and three levels of factor b in all 
combinations. For n = 2, = 1, we have a 2 x 2 X 3 factorial composed of 
twelve treatments. For п = 1,5 = 2, we havea2 X 3 X 3 factorial composed 


of eighteen treatments. These are some of the more common factorials of 


the 2" x 3: type. The randomized complete block design is suitable for these 


factorials in a large number of cases. Occasionally, larger combinations аге used, 
in which case it may be desirable to use one of the incomplete block designs 
discussed in the following chapters in order to exclude some of the hetero- 
geneity within the complete block. The type of material and heterogeneity 
of the experimental area should be considered in selecting a design. 


[261] presents the data for three green manure crops, 
rol, and v2 = Sword Веап, grown with (=m) and 


to have s factors each at three 


Example VIII-1. Sauvage 
% = Sunnhemp, vı = Woolly Ру 
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Since the levels of factor a are compared in three replicates of a rando- 
mized complete block design, error (a) (the replicate X A interaction mean 
square) is the appropriate error for comparing the A effect. Likewise, error 
(b), which is a composite of the interaction sums of squares for replicate X B 
and replicate X A X B, is the appropriate error mean square for testing the 
А X B interaction, and for testing the B effect in some instances [126, sec. 
65]. The replicate X B and replicate X A X B effects are confounded with 
each other and are not separable as such [108]. Even though the calculation 
of these two interactions is possible arithmetically, both interactions are 
estimates of the same quantity and should be pooled together in the error 
(b) sum of squares. This quantity may appropriately be called a split-plot 
X replicate (B X replicate) sum of squares within whole plots. 

The above discussion brings up the question—how does this incomplete 
block design, the split plot design, compare with a randomized complete block 
design with regard to the contrasts on the main effects and the interactions? In 
making this comparison, several points need to be considered. The B and А 
X B effects are usually estimated more accurately than the whole plot treat- 
ments (the А effects), since the variation within incomplete blocks is usually 
smaller than among the incomplete blocks. Also, the number of degrees of 
freedom available for whole plot comparisons is smaller than for split plot 
comparisons. Since the average standard error of a difference is the same for 
both the incomplete block and randomized complete block designs, there is 
no over-all gain in accuracy due to using the incomplete block design. The 
increased accuracy on the B and A X B effects is obtained by sacrificing 
accuracy on the А effects. Also, both sets of comparisons have fewer degrees 
of freedom available for the error variances, which makes the randomized 
complete block design somewhat superior for all comparisons. 1 

Disregarding the difference in number of degrees of freedom, the efficiency 
of the split plot design relative to the randomized complete block design on the 
B and А X B comparisons is 


(p — 1) (p — D(r —D)E;-- [( — D + (y - Dp — D + p Dg - D1E« 
(par — r)Es 
- P- DE. + pl - 1)Es E/ 
(q-DE ~ Ey 


(X-1) 


where p — number of whole plot treatments, q — number of split plot treat- 
ments, r — number of replicates, E, = error (a) mean square, and E; = error 
(5) mean square. On the other hand, the efficiency on the A effects or the 
whole plot comp 


E/E arisons would be decreased, the formula in this case being 


Е; а i 1 3 
sample X-1. During the spring of 1944 a seed-germination test on forty-niD® 


——————M— 
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varieties of guayule was conducted in a greenhouse. The following four seed treat- 
ments were applied to lots of seed from the varieties: 


(i) 1943 collected seed was threshed but not treated with sodium hypochlorite 


= bo; 
(ii) 1943 collected seed was neither threshed nor treated with sodium hypo- 
chlorite = bi; 


(iii) 1942 collected seed was not threshed but treated with sodium hypochlorite 
= be; 

(iv) 1943 collected seed was not threshed but treated with sodium hypochlorite 
= b; 


Since the comparisons on seed treatments and on the interaction of varieties and seed 
treatments were considered to be more important than those on variety mean germi- 
nations, a split plot design with varieties as whole plots was used. Six replicates of a 
randomized complete block design were used for the varieties. The whole plot was a 
greenhouse flat subdivided into four sections (the split plots). One hundred seeds were 
planted in each split plot. The data recorded were number of plants emerged. 

For an illustrative example, eight varieties, ao, а, ***, a; were selected from the 
. first three replicates of this planting. The number of plants that emerged from a 

hundred seeds for the variety and seed treatments are recorded in table X-2 for each 

split plot in the three replicates. 

The necessary totals for the analysis of variance are given in table X-2. The 
correction term is equal to 


X. 2429? 
- = 61,458.76 = CT(1df). 
total no. of plots 8(3)(4) Sn 


The total sum of squares is equal to 
122 + 102 + ... + 11? + 15? — CT = 98,195 — CT = 36,736.24(95 df). 


The replicate sum of squares is equal to 


825° + 781° + 823" — OT = ge sa (2df). 
84) = 32 


The Variety sum of squares is equal to 


296? + 322! + --- + 333° _ Op = 763.16(74). 
43) = 12 


The variety by replicate or error (а) sum of squares is equal to 
97? + 89? +... +101? + 110? _ зв єв — 763.16 — СТ = 1377.25(14d)). 
4 
The seed-treatment sum of squares is equal to 


мер Баа: — СТ = 30,774 28(3df). 
3(8) = 


The treatment by variety sum of squares is equal to 
199° + 190? + ... + 30° + 36? _ ср — 763.16 — 30,774.28 = 2620.13(21df). 
3 
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i in si i i lete block design. The 

i t (=no) nitrogen in six replicates of a randomized comp! 
D Ge VOS in pounds per experimental unit (1/36 acre harvested from each 


plot after border was removed) are for green weights of the crops.' 


TABLE VIII-1. Yield data in pounds of green matter per 1/36 acre plot 


278 323 288 
287 D 305 
91 170 107 


156 157 15 
194 195 285 
19% 18% 292 


1200 3 llo 11 1552 


Sunnhemp Woolly Pyrol Sword Bean Total 


п = 
lo = none 


D; = 150 pounds of (11H, ) 80, /А 


Total 


Source of variation 


Block 
Treatment 


V = variety 
N = nitrogen 
ух! 


Еггог 


Total 
Correction for mean 


Total uncorrected 


The treatment, replicate, total, and error sums of squares in table VIII-1 are 
obtained in the same manner as for a randomized complete block design. The remain- 
ing sums of squares are computed as follows: 


Variety sum of squares (2df): 
3593? + 1511? + 2556? _ 7660? 
2X6=12 36 


1A more realistic measure of the value of the crops with and without nitrogen might be 
obtained by using dry weights instead of green weights of the crop. The heavier yields 


might be due to a higher moisture content; if so, the estimated additional nutrients and 
organic matter added to the soil are too large. 


= 180614. 
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Nitrogen sum of squares (1df): 
3571? + 4089? — 7660? (3571 — 4089)? 
3X6=18 36 3х 61+ (—1)3) 


7453. 


Variely X nilrogen sum of squares (5 — 2 — 1 = 240): 
190073 — 180614 — 7453 — 2006. 


In this experiment, Woolly Pyrol yielded 42 per cent and Sword Bean yielded 
71 per cent as much green weight per plot as obtained from Sunnhemp. The sd for 
a difference between variety totals is 4/. 1860.28(12)(3.52) = 525.9 (section 11-1.1.4). 
The varieties differ from each other with respect to total yield of green weight. 

There is an indication that the effect of adding ammonium sulphate at the rate 
of 150 pounds per acre is to increase green weight. If it is assumed that in no case 
should the addition of nitrogen decrease yields, then we make a one-tailed £ test of 
the difference; thus: 


_ _ 4089 = 3571 _ _ 9.99 > ho(25df) = 1.708. 
-//1860.28(18) (2) 


However, one should know something about the relative moisture content, dry weight, 
and cost of applying the fertilizer prior to recommending the use of ammonium sul- 
phate over no fertilizer. There is no evidence of a variety by nitrogen interaction; 
the results are presented graphically in figure VIIT-1. 

m 


1900 


1700 no 


Total yields (Ibs.) 
a 
S 


Woolly Sword Sunnhemp 
Pyrol Bean 


Figure VIII-1. Graphical presentation of treat- 
ment yields from table VIIH-1 
The standard errors are 


Treatment totals: 


Jer = /1860.28(6) = 105.6. 
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TABLE VIII-2. Sums, sums of squares and products, regressions, 
and analysis of variance of regressions for ni(Y) with 
no(X) by variety 
(9) (,) 
Sunnhemp (vo) 


zx? = 487075 Ze Y= 939394 2,8 = 611934 


m, = 16% EY, = 18% 
(207/6 = 178837.5 | zxx/6 = 536185.0 | (ху)®/6  боо%оо.1 


Ix? = 8237.5 T fy? = 11555.5 


b = 3209.0/8237.5 = 0.3896 
Woolly Pyrol (v) 
= 100636 XXY = 100194 ЖҮ? = 111251 


= 750 a а 
(хх)°/6 = 88816.7 | zxzx/6 = 9501.7 | (zx)2/6 = 101660.2 


Bele 11819. Ixy- 5172.5 ху? = 9570.8 
b = 5172.3/11819.5 = 0.4376 


Sword Bean (v,) 


ZX? = 251556 ZXY = 273109 Y? = 34558 
x Tike EY = 110 
(2х)2/6 = 218886.0 EXEY/6 = 269310.0 (zv)?/6 = 331350.0 
Zx = 12670.0 Ixy = 3799.0 Ty? = 123188.0 
b = 3799.0/12670.0 = 0.2998 


09. 
AT 3209.0 + 5172.3 + 3799.0 


= 0.37218 
8257.5 + 11819.5 + 12670.0 


Analysis of variance 
Source of variation 
Linear regressions Within varieties 


4652.6 
Average regression 3 | 
Among regressions xp 


119.39 
Deviations from individ: 
vidual regressions 29639.43 2469.95 


Total for ny within varieties m 
5290.1 > 
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Variely totals: 
A/sirq = 4/1860.28(6)(2) = 149.4. 


Nitrogen totals: 
J/srp = 4/1860.28(6)(3) = 183.0. 


The coefficient of variation is equal to s/z = 364/1860.28/7660 = 20 per cent. 

The above analysis has been carried through assuming homogeneity of error 
variances for the various comparisons and additivity of the data. For use in future 
research of this type the validity of these assumptions may be tested. First of all 
the validity of the assumption of additivity for nitrogen yields might be tested. Sup- 
pose that there is additivity between yields of no and nı within varieties and within 
replicates. Then, the value of the regression of nı yields, Хоа = У,, on no yields, 
Хов = Жы, should not differ from unity. If the observed regression differs significantly 
from В = 1, there is reason to suspect the assumption of additivity. To test the 
hypothesis that 8 = 1, the sums of squares and products are first computed (table 


450 


400 


350 


ХЕ). 


921.26 


о 50 100 150 200 250 300 350 
Yields, по 
Figure VIII-2. Yields of n; plotted against yields 
of n, within variety and replicate 


sum of cross products for т and no for the different repli- 
) + 278(287) + 246(306) + 331(282) + 323(412) 
3, sec. 12.5]. The test of the hypothesis that 


VIII-2). For example, the 
cates for Sunnhemp is 229(306 


+ 288(305) — 1695(1898)/6 = 3209.0 [273, sec. 
B=1 is ¢ = (37218 — D/4/2469.95/32,726.8 = 2.29 > быу = 2.18. Thus, we 


Would reject the hypothesis of unit slope for the average within-variety regression of 
т yields on по yields (see figure VIII-2). For a given variety the difference between 
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т and ло yields decreases as the yield level of the replicate increases. Thus, there is a 
replicate X nitrogen interaction, and the assumption of additivity is suspected. 

The procedure described in the first part of section II-4 provides a test of another 
sort of nonadditivity. The necessary totals for applying the procedure are given in 
table VIII-3. The difference Жоа — Жоо represents the linear response of nitrogen. 
Hence, the computation of the linear X linear interaction (formulae (II-11) and 
(V-3)) of these differences yields the linear X linear X linear interaction. The sums 


TABLE VIII-3. Totals and deviations for computing sums of squares for non-additivity 


NL. Lp 
I II III IV Yi Total | Deviation 


VI 


3593 86.7 

1511 -86.9 

2556 0.2 
1072 1200 1153 12 11 1552 


-34.1  -12.8 -20.6 27.6 27.4 12.5 


15 тїї 155 672 
072 1200 1155 ilo 11 1352 1660 


7h. -12.8 -20.6 27.6 27.4 12.5 


of squares for the 
Variances do not 


- Since the varieties differ wide] ld, one 
eee u i widely in yield, o 
replicates and varieties differ Widely in ушн et bias к. 
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mental data of this type are needed to investigate these possibilities. Investigations 
of the above nature on a number of experiments will yield empirical evidence of the 
type and extent of the nonadditivity. Then, from these results the experimenter will 
be able to determine whether the analysis should be of the form given in table VIII-4 


TABLE VIII-4. Partitioning of replicate X treatment sum of squares 


Source of variation 


Replicate x treatment 


46507 


Variety x replicate 25178 
Non-additivity 10.4 
Residual 25167.6 

Nitrogen x replicate 4179 
Non-additivity 4035.1 
Residual 143.9 

Variety x nitrogen x replicate 17150 
Non-additivity i ао 


Residual 


or whether some of the components in the analysis of variance may be pooled to obtain 
the estimated error variance for a given contrast. 


УШ-1.2 OTHER FACTORIALS 


In addition to factorials of the 2" X 3' type, we may use 2" X 4", 2^ X ЕН) 
2^ X 6*, 2» X 74, etc., 3^ X 4*, 3" X 5", 3^ X 6', etc., and many other fac- 
torials. The particular number of levels for the various factors depends upon 
the nature of the material being tested and the objectives of the experiment. 
However, with larger factorials, it may be necessary and (or) desirable to 
use confounding and incomplete block designs to control some of the vari- 
ability over the experimental area. Methods and designs for doing so are 


discussed in the following chapters. 


VIII-2. Additional Treatments in Factorial Experiments 


VII-24 FACTORIAL ARRANGEMENT OF TREATMENTS PLUS OTHER 
TMENTS 


It is not necessary that all treatments in a factorial experiment be those 
Constituting a particular factorial arrangement. Other treatments may be 
added if they are of interest to the experimenter. For example, one might 
include a zero level of all factors plus a р" factorial for which the lowest 
level was not zero (problem VII-1). Or, one might include two factorial 
arrangements in one experiment; for example, a 2? and a 2? factorial could be 
arranged in a randomized complete block or latin square design consisting of 
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4 + 8 = 12 treatments. The breakdown of the total degrees of freedom for 
such an arrangement in a latin square design follows:! 


Source of variation df Source of variation df 
Total 143 1 
р 1 
Column 11 CD 1 
Row 11 0 T 
Treatment 11 2! set «E 
A 1 CE 1 
2: Zr 1 DE 1 
AB 1 CDE 1 
2 set vs 2 set 1 Еггог 110 
SSS SS SS Xon nO —— 


If two sets of treatments are included in the e 
a factorial arrangement but the other set need not be. For example, suppose 
that we wish to study the oxygen intake of dairy workers carrying milk pails 
of two different weights, ш, over three different distances, d, down a straight 
path, and pouring the milk into containers at three different heights, h. In 
addition, it is desired to determine the effect of walking down a straight aisle, 
an aisle with a 90-degree turn, and an aisle with steps in it for the combination 
dywoh; and for a longer distance than used for the 2 X 3? factorial treatments. 
Denoting the type of aisle as the A effect, the breakdown of the total degrees 
of freedom for the3 X 2 X 3 + 3 = 21 treatments arranged in two replicates 
of a randomized complete block design is 


xperiment, one set may be 


Source of variation df Source of variation df 
Replicate 1 A a : 
Treatment 20 x 
т b 2 WXxH 2 
À vs factorial set 1 DXWXxH 4 
D 2 Error 20 
y 1 Total 41 
Dxw 


to 


Correction for mean 1 


u р Some experiments, it may be necessary to use sets of factorial arrange- 
eee E erer to keep the treatment combinations within the limit of prac- 
к nether 8 of experiments, it may be inadvisable to include 


of sugar in an i lysis for a 3 X 3 factorial of methods of mixing and 
temperatures X 3 metho, ad with 


a 3° factorial composed of 3 kinds of fat X 3 
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baking experiment, but we know that certain combinations will give failures. 
"Therefore, we make use of the following combinations (marked X) in our 
experiment, since we are reasonably certain that total failures will not be 
produced with these treatments: 


So E 8: 5; 
b; — — x x 
bi — X x = 


If the above seven treatments are compared in a 7 X 7 latin square design, one 
possible breakdown of the total degrees of freedom is (note that treatments 11, 
12, 21, and 22 form a 2 X 2 factorial) 


Source of variation df Source of variation df 
Row 6 5 1 
Column 6 22 set {в 1 
Treatment 6 SXB 1 
Among 20, 30, 02 2 Error 30 
2? set vs others 1 mah ee ean Siac ДОЙ 


= O ee 


О? course the nature of the particular combinations chosen determines the 
treatment comparisons made. With the above set, some information isobtained 
on the interaction of levels of sugar by levels of baking powder. If it is desired 
to estimate the maximum cake volume for combinations of the two factors, 
holding all other factors affecting cake volume constant, one of the methods 
mentioned in section VII-3 or the one suggested by Box and Hunter [35, 36] 
may be used. In selecting specific combinations, the experimenter should make 
certain that the number of replicates (including hidden replicates) is sufficient 


or his purposes. 


УШ-2.2 ADDITIONAL COMBINATIONS INCLUDED IN THE FACTORIAL 
XPERIMENT 

In some situations the experimenter desires to have 
9n the check treatment, the lowest level of all factors, or on one of the other 
Combinations of a factorial arrangement. The decision to include extra repli- 
Cates of a treatment may be made because (i) it is known or suspected that 
this treatment is more (or less) variable than the other treatments or (ii) 
Nore information js desired on this treatment than on the others. Several 


additional replication 
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problems are immediately encountered in the analysis of a factorial experi- 
ment of the above type. Some of these are 


(i) the utilization of all experimental units in obtaining estimates of main effects 
and interactions, 
(ii) the orthogonality of contrasts, 
(iii) alternative estimates of effects, f 
(iv) the treatments to be repeated and the method of estimation resulting in 
orthogonal contrasts, 


(v) the computation of variances for various estimates of the effects, and 
(vi) the'unbiasedness of estimated effects. 


The above points should be considered prior to conducting an experiment 
including extra checks or treatments in a factorial experiment. 


VII-23 PROPORTIONAL FACTORIALS 


If additional treatments are desired in a р" factorial, it is possible to select 
the combinations in such a way that the resulting contrasts form an orthogonal 
Set of comparisons. This condition, in itself, may have little merit [273, sec. 
15.4] unless the estimates have other desirable properties, such as minimal 
variance and unbiasedness. One method of selecting treatments in factorial 
experiments is to duplicate (or triplicate) the levels of one factor at one or 
more levels of the second factor. For example, the treatments 00, 00, 10, 10, 
01, and 11 would form a set of treatments composed of duplicates of as and а 
at the zero level of factor b. А set of treatments selected in this manner is 
designated as a proportional factorial, since the numbers of replicates are pro- 
portional in the two-way (or higher) classification. 

In the 3? factorial the three levels of factor a could be duplicated (or 
triplicated) for one of the levels of a second factor, say bo, resulting in the 
twelve treatments: 00, 00, 10, 10, 20, 20, 01, 11, 21, 02, 12, and 22. Also, the 
level of factor а could be duplicated at two levels of factor b, say by and b, 
resulting in fifteen treatment combinations. This method of constructing 

Proportional factorials rn. 


The problems enumerated in section VIII-2.2 al: 


so apply to proportional 
factorials, 


Although it is possible to select a set of orthogonal contrasts, some 


situations. Prior to con- 
a proportional factorial, it is desirable to obtain 
Ty answers to the problems enumerated above. 


THE INCLUSION OF ADDITIONAL TREATMENTS 


r is fairly certain that the additional checks or other 
varia 
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factorial in five replicates occupies twenty experimental units, and a 2? 
factorial and one additional check in four replicates require the same number 
of plots. The additional replication on the remaining three treatments is 
well worth considering. Likewise, a 2° factorial set of treatments and two 
additional checks in three replicates require thirty plots, while 2° treatments 
in four replicates require only thirty-two plots. Also, the replicate size is 
larger when additional treatments are included. Such considerations as these 
make it imperative to “think twice” before duplicating (or triplicating) 
certain treatments in factorial experiments at the expense of reducing the 
replication on other treatments. 

The main use of proportional factorials is for situations in which more 
information is desired for some factors than for others. In the proportional 
factorial scheme, additional replication is obtained on some combinations, 
the experimental error variance for the repeated treatments may be separately 
estimated, and all available material may be utilized in estimating main effects 
and interactions, In the absence of interaction, unbiased estimates of the main 
effects are possible. The particular form of the estimate used is the one giving 
minimal variance [122]. If the error mean square is used for testing the main 
effect mean squares, no trouble results. However, if the interaction mean 
Square is used as the error, equality of replication of treatments is recom- 
mended, since no appropriate experimental error is possible in a proportional 


factorial [270]. 


ҮШ-3 Designs for p X q X k *** Factorials 


Latin square designs may be used for 2 X 3, 2X 4 2X 5, 2 X 6 2 X 2 
X 3, and 3 X 4 factorials, although the replication may be excessive for some 
of the larger of these factorials. The above factorials plus the 2 X 7,2 X2 X 4, 
X3x 9,2x8,2x9,3xX53X63X1,4X 5, and occasionally larger 
factorial may be designed in a randomized complete block design or as a 
Completely randomized design as conditions warrant. Е ог larger factorials, 
Опе of the schemes of confounding suggested in the following chapters should 
Suffice [324]. 
is For the larger factorials, _ кү 
Pensed wi interactions of som 1 i 
hig агаараа as the estimate of the тшс еггог 
Variance, For example, it was desired to compare three levels of i ө ine д 
two levels of methionine, m, two levels of folic acid, f, and two level в of By, b, 
™ à poultry nutrition experiment. Since only twenty-four M EA ios 
available, a second replicate of the treatments would have to be performed a 
a later date. This was considered impractical. If, from previous pe 
he three-factor interactions and the four-factor interaction could be dnd 
negligible or nonexistent, they may be used as an estimate of the experimen! 


her. е ig mney Б 
tion of the entire set of treatmen 
Bes -factor and the four- and 


TABLE X-2. Number of plants germinating from 100 seeds for each of four seed 
treatments on each of eight guayule varieties in a split plot design of 
three replicates, and totals for the analysis of variance 


Replicate I 


216 


§ X-1.4.1] Whole Plots in a Randomized Complete Block Design 277 


TABLE X-2. (continued). 


Variety and treatment totals 
fe} 


Verieties 


ы бы ee 
199 190 195 251 148. 174 059% 


25 55 38 30 hg ak ш 59 


Whole plot analysis 
Replicate 
Variety = A 


Error (a) 1377-25 


Split or sub-plot analysis 
Treatments = B 
bo vs bi +d +d; 


30774.28 
29829.03 
52.56 


b, vs b, +d 
QE амы? 892.69 


ъ уз b 


AxB 2620.13 
A x bo vsb, +ъ +b 25612 
о 19872 5 632.94 
Axb vs b, + d 
1 2 3 530-48 
Ax by уз bs 


Error (b) 


plus treatment by replicate by variety) sum of 


The erro { by replicate 
г (b) (treatment by гер Ef freedom is equal to 


Squares with 2(3 + 21) = 48 degrees 

36,736.24 — 38.58 — 763.16 — 1377.25 — 30,774.28 — mu = nan 

Since 5 ts among the seed treatments were 0 interest, the 3 degrees 

of ERAS opaca). were partitioned into individual degrees of freedom. 

he b, seed treatment was compared with the mean of the remaining three. This con- 
trast accounted for the major portion of the sum of squares for treatments; thus: 


[3(1340) — 334 — 481 — 274)" = 29,829.03. 
249 4 11-4 D 
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error variance and only one replicate is required. For this case the breakdown 
of the degrees of freedom for the 3 X 23 factorial is 


Source of variation df Source of variation df 
с (choline) 2 3 and 4-factor interactions 9 
M (methionine) 1 CXMXF 2 
F (folic acid) 1 CXMXB 2 
B (Bu) 1 Error }CXFXB 2 
CXM 2 MXFXB 1 
CXF 2 CXMXFXB 2 

СХЕ 2 Total * 23 

MXF 1 Correction for mean 1 

MXB 1 Total d 24 

FXB 1 otal uncorrecte 


If it is not realistic to assume that the three- 
constitute an estimate of the error variance, 
estimate the linear, Cz, and quadratic, C, 
obtained from pooled sums of squares of the interactions of other factors 
with Со may then be used as an estimate of the experimental error. This error 
mean square has 7 degrees of freedom associated with it. Whether or not this 
is a useful procedure depends upon the nature of the treatments in the experi- 
ment and upon the nature of the experimental material used. 


VIII-4 Missing Data 


The estimation of missing plot values in any of the above factorials will 
depend upon the particular experimental design used. If the completely 
randomized design has been used, the method of Chapter IV is recommended. 
In a randomized complete block design a missing experimental unit is esti- 
mated as described in Chapter V. Likewise, for a latin Square design, missing 
data are estimated by the method described in Chapter VI. 


and four-factor interactions 
an alternative procedure is to 
effects of choline. The mean square 


VIII-5 Least Squares Estimates of Effects and 
Expectation of Mean Squares 

The expectation of mean squares and the estimation of effects for a p X q 

factorial in a completely randomized design is equivalent to that presented in 

Chapter V with k individuals per experiniental unit. However, if the pXq 

or the p X q X k factorial is arranged in a randomized complete block 


design, the procedure for obtaining the expectations becomes slightly more 
complex. 


К In factorial experiments the assumption that the p levels of factor апа 

п à E н. is b represent а sample from an infinite population usually 
warranted. For completeness the anal i ite 

and the infinite models e analyses are given for both the finit 
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In Chapters IV to VI the estimates of the parameters, Greek letters, are 
denoted by Latin letters. Since the number of symbols is greatly increased 
in the more complex experiments, this system is discontinued in the following 
chapters. Instead, the estimate of a; is denoted as @;, i.e., the estimate is 
denoted by a hat over the quantity to be estimated. This scheme is used to 
denote the estimate of и in Chapters IV to VI. 


VIII-5.1 THE P X4 FACTORIAL IN A RANDOMIZED COMPLETE BLOCK 
DESIGN 

For the first situation, consider that the p X q factorial is arranged in a 
randomized complete block design [313] and that the linear model, 


Хо = u + po + ai + B; + oi + ei, (VIHI-1) 


represents the yield of the jth level of the factor b and the ith level of the 
factor а in the gth replicate. It is assumed that p is the population mean, 
the effects are independent, p, is the replicate effect, a; = effect of ith level 
of a, B; = effect of jth level of factor b, а8:; = interaction effect of the ith 
level of а on the Jth level of b, and egi; = a random effect. No assumption of 
normality is involved in the estimation of effects or of their estimated vari- 
ances, 

Let g = 1, РА 56111 = 1, 2,4, p; and j -1,2,*:54; let the grand 
total be designated as X..., the gth replicate total as X,.., the ith level 
total of the factor а as Х.;., the jth level total of the factor b as X..;, and the 


Uth total as X... Given the residual sum of squares, 


жо р 1 х ^ 
2023012020655 — 2 — & — & — B; abu = 0, (ҮШ-2) 

9-1 tml jal 2 
the least Squares estimates of the various effects are obtained by differenti- 
ating 9 with respect to each effect, setting each resulting equation equal to 
Zero, and solving for the effects in terms of the observations, 


90 7 A рт abe) 10 
En = —25 33055 eur em ái B; aß 5) 


(ҮШ-3) 


3 = EEL д o4 д aBa) = 0. (ҮШУ) 


abu) = 0. .— (УШ) 


Ш 


` -252:0,5 — & — b A 6 
2257 


90 4, — d; — Bj — 983) = 0. (VIII-6) 
—2 Jio D Ep es й 
2225 oij J 


Ш 


— 4 — Bj — as) = 0. (VIII-7) 


Ш 


—2 (Хш, =A Йй 
2 
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К we set 328, = 226: = DB; = Labi = abs = 0 and solve the 
above equations, we find that 


p = X.../pgr = z, (VIII-8) 

cT. (VIII-9) 

Po Pi BE. 2, 

у= ш шш (VIII-10) 
Ti 

HE Žo ар (VIII-11) 


and 


«Ву = 0 — a - Êi- p= Bay — H.-G. +8, (ҮШ-12) 


The variance of р is 


Е[а — р)? 
= Ey - 
= E| „+ Б» + Eut + Бойы 
О uer ТОП Бей АЫ i 
DIT | о, 1 
ur» Be (VIII-13) 


if it is assumed that Ур; = Dia: = 2.8; = 2 абу = У 085; = 0. Likewise, 
LI J 
the variance of 8, is 


1 1 2 
Ela, 3i ро]? = [ээ т se] 
E for So, = 0. (VIII-14) 


The covariance of any two f's, say f; and f», is equal to 


| Gzr«-irrre(irre-Zrrre)) 


= —c rpg. (VIII-15) 

E чн the variance of the difference between any two f's, or of any 
?p'icate means, is 2¢2/pq; the vari гіз (p — 1)с2/7рд; the 

covariance of any two sis а шу dris (p c лели; 


e — ed /rpq; the variance of any B;is (д — 1)с2/тра; 
the covariance of two B's is —/гра; the variance of diy is(ppg-p-—4 
+1)с2/грд; and the covariance of tw à 


о a's depends upon the two items 
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concerned. For one i subscript in common the covariance between two 
08:8 is —o2(p — 1)/rpq; for one j subscript in common the covariance is 
—o?(q — 1)/rpq; and for no subscripts in common the covariance is +с2/грд. 
Therefore, the variance of а difference between two «878 with one i subscript 
in common is 20(р — 1)/rp; with one j subscript in common the variance 
is 2¢2(q — 1)/rq; and with no subscripts in common the variance is 2c (pq 
— P — q)/rpq. 

In obtaining the following expectations of the various sums of squares, 
it is assumed that the pp, æi, Bj «Bij, and є,:; are random variates from an 
infinite population with mean zero and variances c,?, ca, og, gas’, and cè, 
respectively. Likewise, it is assumed that all effects are additive and inde- 
pendent and that the linear model given by equation (VIII-1) is appropriate. 
As with previous expectations, no assumptions involving normality are re- 
quired. However, the assumption of normality usually comes in when one 
Starts testing hypotheses and utilizing the variance components in other ways, 
such as estimating average genetic advance and heritability. 

The expectation of the total sum of squares corrected for the mean is 
equal to 


b> р. 2 Xi? — x 3] = ЕРУУ + p; + es + Bj abi + eis)? 


1 
— spi "PM + ра, + ara + probs +r Leaky + UU Lew)! 


= pau + o, + oa? + og! + cas? + 92) — (раги? + рас? + are 

+ pros + roast + a2) = par — De + аг(р — Vee! + pra — Dog 

+ r(pg — locas? + (par — 1)e4, (VIII-16) 
With rpg — 1 degrees of freedom. 


The expectation of the replicate sum of squares is 
a| pre! х. 
Pq pqr 


x. 
ш Por + ра, + zo. + PEB + ав + 22e — =: 


= rpqu* + rpqo „2 + гас? + rpag + Гтав? + Тоё 
— E[X..2/rpq] = pa(r — Io? + (r — Dees 
With (r — 1) degrees of freedom. 


(VIII-17) 


The expectation of the sum of squares attributable to the factor a is 


X. X X..2 = — 1)(с2 + reas? + го 2), (VIII-18) 
р> трд ] (p ) Сав 
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with (p — 1) degrees of freedom, and the expectation of the interaction sum 
of squares is 


pass Уу уу х.) 


rg гр rpq 
= (pg — p — 9+ Y) (e? + гов), (VIII-19) 


with (pg — p — q + 1) degrees of freedom. The remainder of the expectations 
are obtained from table VIII-5 and are presented in table VIII-6. 

Аз stated before, the infinite model may be inappropriate for certain 
factorial experiments. "Therefore, suppose that factors a and b have only 
p and q levels, respectively, and that Уе; = 0 = > \8;; this also means that 
2 abi; = 0 = afi. In this case, let Eo? = o2, EB? = B?, and E(af; 


= (aß). With these restrictions on equation (VIII-1) the expectations of 
the sums of squares are obtained as before. However, the form of the expec- 
tation is quite different. From the coefficients of the various components in 
table VIII-5 the expectations of the various mean squares are obtained 
(table VIII-6). In this case, it will be observed that the error mean square 
is appropriate for testing hypotheses about the existence of ? o2, >>, and 
>72 (aß). For the infinite model the A X B mean square is used to test 
hypotheses concerning c? and og. 


VIII-5.2. THE p X q X k FACTORIAL IN A RANDOMIZED COMPLETE BLOCK 
DESIGN 


For a p X q X k factorial arranged in a randomized complete block de- 


sign, we shall assume that the yield of the gijhth experimental unit may be 
expressed by the following linear equation: 


Aon = M+ pa + os + B; + ов + Yn + «ү + Brin + «бүз» + є) 
(УШ-20) 


where ш = population mean, a; = effect of the ith level of factor a, B; = effect 
of the jth level of factor b, «8; = effect of the ith level of factor a at the jth 
level of factor b, y, = effect of hth level of factor с, ayin = effect of ith level 


of factor а at the hth level of factor с, Djs = effect of jth level of factor b at 
the hth level of factor c, Bis = effect of ith level of factor a at the jth level 
of factor b and the hth level of factor c, and egija = the random component of 


error variation associated with the gijhth unit. 

е assume that the ру, œi, Bj, «8, Ym yin, Вүзһ, «ВҮзуһ, and єргуһ ATC 
random variables independently distributed with zero mean and with variances 
j^ Т 59 в » Caf’, Oy, Cay”, сву, савту, and cê, respectively, the expectations 
of the various sums of squares and mean squares are as given in tables VIIJ-7 
and ҮПТ-8 for the infinite model. If, on the other hand, we assume that these 
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are finite populations composed of the particular levels used in this experiment, 


then Уо: = 228; = Dvn = ais = Davin = 2 Brin = 2 ri» = о 
and the expectations of the various sums of squares аге as represented in 
tables VIII-7 and VIIT-8 for the finite model. The error mean square is appro- 
priate for testing hypotheses about the existence of the various effects for 
the finite model but not for the infinite model. 


VIH-5.3 ANALYSIS FOR ALL POSSIBLE SINGLE CROSSES AMONG k LINES 
IN А RANDOMIZED COMPLETE BLOCK DESIGN: 


In genetic experiments, it may be desirable to estimate the variance 
components due to variation among Ё lines of a strain when used as males and 
when used as females, or of crosses and of their reciprocals.? In addition, the 
variation due to the interaction of the ith line crossed with the jth line may 
be of interest. Suppose that the randomized complete block design is used 


and that the yield of the gijth experimental unit may be represented by the 
linear model, 


Xo = u + po + os + B; + аВ:; + єр, (VIII-21) 


where g = 1, 2, ---,r;ijj— 1, 2, --+, k, except that the combinations ij for 
i = j are omitted, resulting in k? — k = k(k — 1) crosses; p = population 
mean; р, = effect due to gth replicate; a; = effect due to the ith line; В; = ef- 
fect due to jth line; «8; = effect due to ith line being crossed with the Jth 
line — nicking effect — effect due to dominance and epistasis; and €gij = random 
component associated with measurement of X,;;. To be certain that the treat- 
ments are understood, they are presented below in tabular form. The coeffici- 
ents in the table represent the number of observations on each of the treat- 
ments, There are no lines crossed with themselves, or selfed. 


Male lines 


"The proof in this section wa: 
reproduced through his courtesy. 

ЗА similar situation has arisen in ps 
other but cannot give the test, to t 


5 developed by C. R. Henderson, Cornell University, and is 


Ychological work where subjects give a test to each 
hemselves, 


һоооооооо нк 


ха: 555 


чёт. 
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TABLE VIII-8. Expectation of mean squares for a p X q X k factorial in a randomized 
complete block design 


Infinite model 


Source of variation аг Expectation of mean square 


Replicate (r - 1) = Pako? 
(p - 2) e TOv * rom. * тко? + rako? 


г 2 2 г г г 
(а-1) Oe" + тазу? + трорү® + тко? + rpkog 


(p - 1)(q - 1) es rogy + тко? 
(k - 2) в°+ TOY * трорү? + таш + грос? 


(p = 1)(k - 1) с 2 + TOY + Tq, y" 


(a - 2)(k - 1) oe + rogy + rpo 
(p - 1)(q - 1)(k - 1) ? 4 ro P 


озү 
(r = 1) (рак - 1) 


Expectation of mean square 


(== 1) UE * рака 2 
(p - 1) о? + rax, ?/(p - 1) 
(a - 1) o? + тркєв */(a - 1) 
(p - 1)(9 - 2) ас? + тк®Е(оё, ‚)°/(р - 1)(a - 3) 
(x - 1) o,? + rpav,?/(k - 1) 
(p - 1)(k - 2) 0? + razZ(ov,.)?/(p - 1)(k - 2) 
(a - 3)(k - 2) a? + rpIE(BYs )°/(а = 1)(х - 2) 
(p - 1)(а- 2) - 1) | a? + xzzz (oov, ,*/o - 3)(a - Ye - 2 
(r - 1)(pak - 1) 


If the equations resulting from partial differentiation of the error sum of 
squares with respect to the various effects are set equal to zero, the following 
normal equations are obtained: 


rh(&— DB--k(k-1)5^5,--r(k—1)024--2B)--r3naBu- X.., (УШ-22) 


6—1) (0+2) + (R-1)(Lat 3282 - X28 — X... (VIII-23) 
r(k— 1 (--8)-- (k—3) Da, trd ÂH) — X... (VIII-24) 


r(k— D G-E8)-- (1) 278, r^ (at) = X... (УШ-25) 
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and 
та + 4 + B; + ав) + Ds = Ху. (VIII-26) 


If we impose the restrictions that 


La = Las = DB; = Labs = Lewy =0,  (VIII-27) 


inj int 
the least squares estimates of the various effects are equal to 
д = X.../rk(k 1) =4, (VIII-28) 
eL coge o 1-29 
ĝo = k(k — 1) 2 = G,.. — #, (У ) 


& = (k -DX4.+ Xi X..]/rk(k — 2, (УШ-30) 


Bs = (E—1X.;-- Xj. — X...]/rk(k — 2), — (VIII-31) 
and 


abi; = 21 —p— 4 — B. (VIII-32) 


The expectations of the various mean squares are presented in table 
VIII-10 and are obtained from the table of coefficients of the various compo- 
nents (table VIII-9). Because of the nonorthogonality of the data the sums 
of squares for A, B, and A X B are in a slightly different form than previous 
Sums of squares, If the method for unequal numbers analysis described by 
Snedecor [273, Ch. 11] were followed, we would obtain this form of analysis. 
€ sum of squares for the A X B interaction is the sum of squares for the 
— k crosses minus the sum of squares due to the @; and 6; constants. 
Likewise, the sum of squares for &; constants is the sum of squares for the 
4 and б; constants, УХ. + 228/X- 5 Tue she sum of vor due to 
fitting the B constants, assuming no 8, клу тк = If the 
other k treatments, i.e., the selfs = афу, were included in the experiment, the 
analysis would reduce to that presented in section VIII-5.1. 


2 


УШ-5.4 ANALYSIS FOR k(k—1)/2 POSSIBLE CROSSES AMONG К LINES 
А RANDOMIZED COMPLETE BLOCK DESIGN 
Some breeding experiments are conducted using the k(k — 1)/2 crosses 
among a. group of k lines. Because the reciprocal crosses are assumed to be 
identical, they are not included in the experiment [111, 278]. If the design is a 
Tandomized complete block, the yield of an individual experimental unit is 
assumed to be represented by the linear equation, 


Хы = w+ po + os + оу + abi + eis (VIII-33) 


Where g —-L2,...nizsj-L2,5skh the double subscript ij runs from 
с=з с *, k; o; = effect due to ith line; о; = effect due to jth line; 


+22 


TABLE УШ-9. Coefficients of components for k? — k treatments іп a randomized complete block design 


Sum of squares 


г? Ix - 1) 


^ 
Sor m TON. d 


zx? /r(k -1) 


mé /т(к - 1) 


TS 


Components (infinite model) 


rk(k - 1) 


2r(k = 1) 
rk 


rk(k = 1) 


TABLE VIII-10. Expectation of mean squares of К? — k treatments in a randomized complete block design 


Replicate 


Among a lines = A 


Among b lines » B 


Q?-x-21)r-21) 


B» 


k(k = 


їй 
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1) 
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4,7 BX T 1) 


x? 


A d 


XN 
3) p(y = 2) 


E Gee 
BY (x = 1) 


rk(k - 1) 
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rk(k - 1) 


2 
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226 Other Factorial Experiments [$ VIII-5.4 


«В; = effect due to the cross of the ith and jth lines; and &;; = random 
component for the gijth observation. The sums of squares and expectations 
of mean squares are presented in table VIII-11, assuming that these lines 
constitute a random sample from a large population. The sum of squares for 
the k(k — 1)/2 crosses may also be obtained as 


E (k(k—1) y 
dal CS CO re 
rek- ', 


(VIII-34) 


with — 1 = (k — 2)(k + 1)/2 degrees of freedom. 


The equations for the various totals are 


k(k — 1) 
2 


E Deu Yo) + re — Dr DE dU E. 

d e (mas) 

r(k — 1u + (k — 1) p + r(k — о: + es + afi) +% Bh = Х.:, 

(VIII-36) 

r(u + o; + a; + ав) + У + н) = X.s, (VIII-37) 

and 

Kem 1) 

T ca) (k – Nw +, Б (ais + єн) = X,... (VIII-38) 


When these totals are used, the expectations are obtained as for previous 
examples. 

In addition, estimates of specific and general combining ability [278] for 
the particular set of k lines used in the experiment might be desired. If so, 
we may assume that 


об» = 0 = Dae (үш-) 
у= 
From equation (VIII-39), —o; = Ye With these restrictions, equations 
ј=1яї 
(VIII-35) and po s now become 
k — 
Gru Dp) + Уе = ree =X...  (VIIIA0) 


and 
(k — 00 + Ую) + r(k — 2)о; nx =X... (ҮШӘН) 


The di k 
е difference (2х. = х...) squared has the expectation: 


ГЕ — 2)? +. ES 
CE D Mag + HE- 06-2 (УШ-42) 


TABLE VIII-11. Sum of squares and expectation of mean squares for k(k — 1)/2 crosses in r replicates of a randomized complete block design 


Source of variation Expectation of mean square 


k(k - 1) Tk(k = 1) 


X 
rh(—= - х...) 
2 


rk?(k - 2) 
Within lines k(k - 3)/2 Subtraction of above from 55 


` (K? =k - 2)(r - 1)/2 as 
rk(k - 1) 
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i eo 
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` Therefore, we estimate @2 as 


ЛЕКЦ уи (e. Le у д (VIII-A3) 
r 


k(k — 2) Xr*k(k — 1)(k — дут 
This is the form of the estimate of the variance for general combining ability 
for the ith line as presented by Sprague and Tatum. Strictly speaking, @ё is 
not a variance as ordinarily defined. 
If we make use of equation (VIII-39), the sum of squares among the 
(k — 1) crosses in which parent i is involved has the expectation: 


E (0 — 2) Х.; — X... — Х..; + 2X.../(k — ЭЗ 


is rE- 3) — 3) 
dg. TERR. (k= т, д. (VII-A44) 


- with (k — 2) degrees of ЧОК 


Considering the (k — 1) lines crossed with the ith line аз a sample of 
lines, the variance for the mean of the ith line is 


Х.; E P ССА Б 1 
Е дечу J - [zs] - (= гё E (УШ-45) 
since Уур; = 0 and р> oj = —a;. 


=1яі 
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Confounding in Factorial Experiments 


IX-1 Confounding—Use and Types 


The number of treatment combinations increases rapidly as the number 
of factors or as the number of levels of each factor is increased. For experi- 
ments containing over ten to twelve treatments, the latin square design 
becomes impractical, and as the number of treatments increases it becomes 
exceedingly difficult to select replicates for a randomized complete block 
design which are relatively homogeneous. Because the variation within repli- 
cates tends to increase as the replicate size increases, resulting in a larger 
experimental error variance, it is desirable to keep block sizes small. In order 
to retain relatively small block sizes for a large number of treatments, only a 
portion of the treatments may be included in a small block. The resulting 
blocks are called incomplete blocks [324]. By a device known as confounding 
[126, 318, 319, 324] the necessity of including all treatments in each block (or 
row and column in a latin square) is side-stepped. The whole block or repli- 
cate is divided into the desired number of incomplete blocks. Then, by remoy- 
ing the variation among incomplete blocks (freed of treatment effects) within 
replicates the experimental error mean square is often much smaller than 
what it would have been for a randomized complete block design. The smaller 
error variance results in more precise comparisons among some treatments. 
The segregation of the blocks within replicate sum of squares results in a 
decrease in the degrees of freedom associated either with the error or treat- 
ment sums of squares or both. This means that, in some cases, information 
among some treatment contrasts may be mixed up with or confounded with 
the incomplete block differences. If a particular treatment comparison is of 
little or no value, then this feature may be included in the incomplete block 
design. Indiscriminate confounding may result in complete loss of information 
f greatest importance. This means that the 
experimenter should confound with incomplete block differences only those 


contrasts or comparisons of little or no importance. 
If an effect is of little or no interest, it may be confounded with the in- 


known as complete confounding. For example, it may be desirable to have in- 
complete blocks of four plots in a 23 factorial experiment with three repli- 
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cates and the effect ABC may be of little or no importance. The treatments 
in an incomplete block will be one of these two groups: 


(ABC), (АВС), 
001 000 
010 011 
100 101 
111 110 


"Thus, the comparison between the incomplete blocks is also the comparison 
between (ABC); and (АВС). 

If an effect is confounded with incomplete block differences in replicate 
I, a second effect in replicate IT, and a third effect or one of the first two in 
replicate III, these effects are partially confounded with incomplete block 
differences; i.e., they are confounded with incomplete block differences in 
some replicates and unconfounded in others. Some information is available 
on all comparisons, but some comparisons are more accurately determined, 
since they are made in all replicates instead of being made in only a portion 
of the replicates. 

The term balanced confounding has been defined as the incomplete block 
design for which all effects of a certain order, say all two-factor interactions, 
are confounded with incomplete block differences an equal number of times 
[60, 324]. In the present book, this type of design is defined as a partially 
balanced one, and the confounding is denoted as balanced partial confounding. 
If all two-factor and three-factor interactions are confounded with incomplete 
block differences, the design is still partially balanced. Likewise, if all effects 
are confounded with incomplete block differences an equal number of times, 
the design is said to be balanced. The addition of the term partially balanced 
is deemed necessary for the discussion of a group of incomplete block designs 
which are known as lattices [see Chapter XI; 177]. 

On the other hand, if all effects of a certain degree are confounded an un- 
equal number of times, the resulting design is denoted as unbalanced. The 
type of confounding is denoted as unbalanced partial confounding. In several 
of the factorial experiments in which partial confounding has been used, the 
additional computations are not difficult. However, with certain other in- 
complete block designs, additional computations and complexities are en- 
countered with unbalanced arrangements. 

At this point, it should be noted that the experimenter may use more 
е rather than use confounding. The resulting standard error of a 
ad Miro mem DRY Bs made as small as one wishes simply by obtaining 
cd iba MES e s. In Home experiments (e.g. laboratory), it may be less 
Иске, е А сыг a randomized complete block design than 
addu rire s ofa confounded arrangement and to perform the 
ао оа ооой Ѓог the analysis. The experimenter should 

when designing an experiment, 
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IX-2 Advantages and Disadvantages of Confounding 


In order to utilize confounding and confounded arrangements in factorial 
experiments in an efficient manner, it is necessary to comprehend the purpose, 
the advantages, and the disadvantages of confounding. The purpose of con- 
founding is to stratify the experimental material into homogeneous subgroups 
because the material is variable, because only a portion of the treatments 
can be carried out in a relatively homogeneous time interval or by a single 
technician, or because of some other experimental condition. It is desirable 
to eliminate extraneous sources of variation wherever possible. In some in- 
Stances the extraneous variation may be removed by covariance and in others 
by stratification. The randomized complete block design may have excessive 
variation within the block unless some subgrouping is practiced. The number 
of treatment combinations may be too large to use a latin square. Hence, the 
next plausible thing to do is to subdivide the complete block into incomplete 
blocks containing only a portion of the treatments. The particular treatments 
allocated to an incomplete block are determined by the nature of the experi- 
mental material and the relative amount of information desired for the various 
Contrasts among the treatments. In many cases, removal of the variation 
among incomplete blocks within a complete block greatly reduces the residual 
Variation. However, if the experimenter has no idea of the amount of reduction 
to be obtained, no realistic decision can be made concerning the question of 
confounding. To obtain information on this, results from previous experi- 
ments may be summarized to estimate the gains due to confounding [319]; 
also, uniformity or blank trial data may be analyzed to compare the vari- 
ability within incomplete blocks and the variability among incomplete blocks 
within replicates. For either method, it is recommended that several sets of 
data be available. In summarizing results from previous experiments, it is 
recommended that thirty to forty experiments be the minimum number for 
reliable estimates even though fewer, say fifteen to twenty, experiments will 
Provide useful information. The optimum number of uniformity trails to use 
depends upon the differences among trials, but it is recommended that the 
experimenter use at least ten and that he compare the layouts in two perpen- 
dicular directions, 

The disadvantages of confounding are the following: 


(i) the confounded contrasts are replicated fewer times than are the other 


^. contrasts, : 
Gi) the calculational procedures are usually more difficult, and ИР 
(iii) considerable difficulties are encountered if the treatments interact with in- 


complete blocks. 


No comparison should be completely confounded unless there is consider- 
able evidence to show that the comparison is of little or no value or that the 
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effect is nonexistent. The experimenter should ascertain the relative impor- 
tance of the various contrasts and partielly confound the effects of least 
importance. 

The additional computations involved depend upon the type of confound- 
ing used. The experimenter will be well advised to set up the analysis prior 
to running the experiment. From among the suitable confounded arrange- 
ments, he should select the simplest computationally. The time saved on 
computations can usually be well spent upon the interpretation of the ex- 
perimental results. Also, the experimenter may save himself considerable time 
and anxiety if he writes out the analysis prior to conducting the experiment. 


IX-3 Complete Confounding 


A particular contrast may be of little or no interest and it may be decided 
to confound the contrast with differences among incomplete blocks. How- 
ever, before doing so, the experimenter should be certain that this is desirable, 
and he should consider the available comparisons among the treatments. 

One further point arises concerning the analysis of experiments in which 
one or more effectsarecompletely confounded with incomplete block differences, 
and that is the adjustment of treatment means. If il can be assumed that the 
completely confounded effects are equal to zero, then adjustments should be made. 
However, it is felt that this is not usually a valid assumption and that adjust- 
ments are unnecessary. If the assumption that the confounded effect equals 
zero is incorrect and if the treatment means are adjusted assuming a zero 
effect, the resulting means are biased. The unconfounded effects are not af- 
fected by the adjustment, since they represent within-incomplete block 
comparisons. 

IX-3.1 RANDOMIZATION 


The sets of treatments should be randomly allocated to the incomplete 
blocks with a different random allocation for each replicate or complete block. 
The treatments within each incomplete block should be randomly allotted to 
the experimental units. A different random allotment is used for each incom- 
plete block and for each replicate. If a latin square design is used with con- 
founding, special rules are sometimes necessary. These are described in 
conjunction with the particular arrangement used. 

By randomly allocating the sets of treatments to the incomplete blocks, 
some information is available on the effects that are completely confounded. 
If there is a differential response (see next section) of an effect in some incom- 
plete blocks, this may cause other effects to be overestimated. This is a further 
reason for randomization of the sets of treatments to the incomplete blocks. 


ІХ-3.2 COMPLETE CONFOUNDING IN THE 2* SERIES 


E instances the higher-ordered interactions or even the main effects 
(see Chapter X) may have little or no meaning, and it may be decided to 
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confound these comparisons with incomplete block differences in all repli- 
cates. Ап example of complete confounding of the ABC interaction of a 23 
factorial experiment with incomplete block differences in all three replicates 
is given below. 


Replicate I Replicate II Replicate III 


000 111 
110 010 


101 001 
011 100 


I, Is Ih п, шь ш, 


The treatments in incomplete blocks I», Ie, and IIT, are those which make up 
(ABC); and those in blocks Ia, IIa, and III, make up (АВС). The groups 
are allotted to the incomplete blocks at random, and the treatment combina- 
tions within each incomplete block are allotted to the plots at random. The 
comparison between incomplete block totals in a replicate also represents the 
contrast between (ABC), and (ABC)o. The partitioning of the degrees of 
freedom in the analysis of variance follows: 


Source of variation Degrees of freedom Mean square 
Replicates 2 
ABC 1 — | Е 
Replicates X АВС 2 Ep | 
Treatments 6 

A 1 

B 1 

AB 1 

с 1 

АС 1 

вс 
"Total 23 
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. The contrast of the levels (ABC): and (ABC), is not well determined, 
Since only three replicates are available for the contrast and since the repli- 
Cates X ABC mean square, which is the error term for testing the above 
Comparison, is determined with only two degrees of freedom. Thus, most of 
the information on the ABC effect has been sacrificed at the expense of having 
blocks of four instead of blocks of eight plots. The increase in precision due 
0 confounding (ignoring the difference іп the degrees of freedom for r repli- 
Cates of the incomplete and complete block designs) is obtained from the 

ormula, 

E/ (p—-14 DE + [6+ 6(r — )]E, Еу + 6E. (IX-1) 
Е, [r—14 1-4 6- 6(r — 1)]Е. "270880: 


234 Confounding in Factorial Experiments [8 IX-3.2 


where E; and E, are the two error mean squares in the previous example, 
r is the number of replicates, and the ABC interaction is not assumed negli- 
gible. In the event that the interaction is assumed negligible, the efficiency 
of the incomplete block relative to the randomized complete block design is 
equal to 
E/ _ гЕь-+ [6+ 6(7 – Е, Е, + 6E. (IX-2) 
E. [rt 6c 6(r—D)]E TE ' ё 


where E; is the incomplete blocks within replicate mean square. The experi- 
menter should usually use the former measure of efficiency, since no assump- 
tions are made relative to the magnitude of the confounded effect. 

In the event that the groups of treatments making up the levels (ABC): 
and (ABC), are not allotted to the incomplete blocks at random, it is not 
permissible to partition the 5 degrees of freedom for the blocks sum of squares, 
since the design is systematic. Also, some difficulties may be encountered if 
the various error terms making up the residual mean square are not homo- 
geneous. These have been discussed by Yates [319; 324, p. 32], and others 
[172; 175, sec. 14.5; 60, sec. 6.21]. A differential response of the factor a in 
the incomplete blocks with responses due to the factors b and c results in a 
large BC effect owing to the arrangement of the incomplete blocks. The possi- 
bility of this happening should not be overlooked in experiments, nor should 
it be overemphasized. 

As a further variation of complete confounding of the ABC effect in the 
25 factorial, Yates [324, sec. 8a and 8b] presents an unusual design using com- 
plete confounding and two latin squares. In the first plan [324, table 26], 
Yates gives the systematic arrangement for 2° treatments arranged in two 
4 X 4 latin squares. The ABC effect is confounded with columns and BC with 
rows. As a variant of this design the ABC effect is confounded with squares; 
the four treatments making up (ABC), are put in one 4 X 4 latin square and 
the remaining four in a second 4 X 4 latin square. The two 4 X 4 latin 
Squares represent a random selection of two of the four standard squares; for 


example, the following two squares might have been chosen with four of the 
eight treatments in each square: 


Бшге Square II 
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In accordance with the rules for randomizing latin squares the columns and 
all rows except the first row are randomly arranged. The next step is to inter- 
lace the squares by randomly allotting the members of the first pair of columns 
(or rows) to the first two “columns” (or "rows") and so forth for the remaining 
pairs. The resulting design then has eight columns and four rows; the arrange- 
ment might be of the following form: 


The analysis of variance for the above design is 


Source of variation Degrees of freedom 


Between squares (= ABC) 

Among rows within squares 

Among columns within squares 

Among treatments (А, B, AB, C, AC, BC) 


3+3 = 6 
3+3 = 6 
6 
2 


Other variants of confounding in latin squares are discussed in various places 
leg., 47, 60, 175; Chapter XV]. : 

A 2 factorial experiment may be arranged in blocks of two, four, or eight 
treatments. Likewise, the 25 may be arranged in blocks of two, four, eight, or 
sixteen treatments and the process continued for larger values of п in the 
2” factorial series, If 2" treatments are put into blocks of 2* treatments, then 
(27-* — 1) effects are confounded in each replicate. For complete confounding, 
the same treatments with a different randomization appear together in an 
incomplete block. In most cases, complete confounding is not advisable. If 
only one replicate of a 2” factorial in blocks of 2* is used and the higher-order 
interactions are used for experimental error [324, sec. 7; 60, sec. 6.31; 175, sec. 
14.7; Chapter VII] then one of the schemes of confounding for one of the repli- 
cates as described by Yates [324, sec. 5] and others [11, 60, plans 6.1 to 6.6; 
175, P. 261] may be used. For example, it may be desired to have 25 treatments 
m blocks of 23 plots. If it is desired to completely confound two three-factor 
interactions, say ABC and CDE, then the interaction of these two effects, 


16 
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ABC x CDE — ABDE, which is a four-factor interaction, is also confounded. 
The schematic arrangement of treatments in the four incomplete blocks is 


Block Effect 
^ 1 [00000 00011 11000 11011 01101 01110 10101 10110 | (АВС), (СРЕ), (ABDE)o 
II |00001 00010 11001 12010 01100 01111 10100 10111 | (АВС), (СРЕ), (ABDE): 
ш |10000 10011 00101 90110 01000 01011 11101 11110 | (ABC),, (CDE)o, (ABDE): 


IV |10001 10010 00105 00111 01001 01010 11100 11111 | (ABC), (CDE);, (ABDE)o 
1 


'The four incomplete blocks аге assigned to the four sites at random, and the 
eight treatmer(s within each block are randomly assigned to the eight experi- 
mental unifg in each incomplete block. If the unconfounded three-, four-, 
and five-fictor interactions are used as an estimate of the experimental error, 
the bre;kdown of the 31 degrees of freedom is 


Tu Source of variation Degrees of freedom 
Blocks (= ABC, CDE, ABDE) 3 
Main effects (A, B, C, D, E) 5 
Two-factor interactions 10 
Residual 13 
Total (corrected for the mean) 31 


Likewise, s X s latin squares may be used for 2” factorials in which some 
effects are completely confounded. Yates [324, sec. 8] and Kempthorne [175, 
sec. 15.6] have discussed complete confounding in latin squares and have 
presented some plans [also, see Chapter XV]. 

Incomplete blocks of size 2 have an important place in the biological world. 
A leaf may be divided into half-leaves. The roasts from the left and right sides 
of an animal are naturally paired. The two arms or two legs of a patient are 
naturally paired. Likewise, identical twins in humans and animals form pairs 
of homogeneous experimental units. Since a wide range of experimental 
material occurs in pairs, the confounded designs with incomplete blocks of size 


2 form an important class of designs. For larger blocks the experimenter may 


E mates or other homogeneous experimental material for an incomplete 


Example 1Х-1. To illustrate the computational procedure for complete confound- 


n hy ellie discussed by Yates [319] has been selected. The 2: treatments of two 
m : Е no ч п, two levels of phosphate, poand ру, and twa levels of potash, 
pied Bl КЫ combinations жеге applied to peas. The resulting /yields of peas in 
КИЧЕ. Ra i ү seis in table IX-1. The NPK effect was completely con- 
piat qus E d gan among the incomplete blocks of four treatments. Three 
тр i T i totals, effect, totals, and the analysis of variance are pre- 

7 7016 computations follow those described in Chapter VII for 


TABLE ЇХ-1. А 2! fertilizer experiment on peas (pounds/plot) 


Replicate I Replicate II Replicate III 


(001) 45.5 (оо) Mk. 49.8 (о) (011) 55.2 (ооо) 56.0 
(100) 62.0 (111) 48.8 52.0 (000) (101) 57.2 (110) 59.0 


(ооо) 46.8 (101) 57.0 55.5 (010) (100) 69.5 (001) 55.0 
(011) 49.5 (110) 62.8 59.8 (11) (010) 62.8 (111) 55.8 


Treatment totals 


x x 


000 Х.100 *.o10 Х.110 *.001 Х.101 X. 


HE 


.5 


Iv) 


155.5 191.5 165.0 175.8 156.0 164.0 15 


ЕЕ 2 

g 

P 

ee ee 
жвае |[Ро 

++++++++ 


Е] 
A 


216.1 209.1 


200.5 229.8 


Replicate 177.803 


Es 57.002 

NPK x replicate = Еггог (а) e 

Treatment е? 
Т 189.282 
н 5 
Р 95.202 
E 2128 
E 55.1 
ж 0.482 


Residual = Error (5) 185.285 


Total 


Correction for mean 1 
Total uncorrectea 


Adjusted treatment totals 


' t r 
Xlo Х' aoo Z'o X'no Xon Х'лоу Xuan Xum Total 


158.05 187.575 159.275 177-525 152.275 167-725 155.225 159.375 1317-000 
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the 2? factorial: The only new computations are those for the NPK X replicate or 
the error (a) sum of squares; the error (a) sum of squares is equal to 
(216.1 — 200.5)* 4- (202.1 — 229.8)? + (225.4 — 243.1? (673.4 — 643.6)? 
4(1? + (—1)?) = 8 a 24 
= 128.490, with 2 degrees of freedom. 
The totals are the incomplete block totals from the two-way table of totals for the 
levels of the three-factor interaction and replicates (table IX-1). 

The lsd for the total effects, (N), (P), (NP), (K), (NK), and (PK) is obtained from 
the error (b) mean square; thus, lsd = ts(12df) /2(3)(4)E. = 2.179 4/24(15.4404) 
= 41.95. The two totals (N) = 67.4 and (K) = —47.8 exceed this value. The con- 
fidence interval for an effect total is the effect total minus the lsd to the effect total 
plus the lsd; e.g., the confidence interval for (К) is from —47.8 — 41.95 = —89.75 to 
—47.8 + 41.95 = —5.85. The addition of nitrogen increased the yield, whereas the 
addition of potash decreased the yield of peas. The deleterious effect of potash on the 
yield of peas as well as the beneficial effect of nitrogen requires explanation in light of 
biological knowledge of the physiology of the pea plant. 

The NPK effect is subject to a different error than the remainder of the effects. The 
lsd for the total for the NPK effect is tos(2df) /2(3)(4)E, = 4.303 4/24(64.2450) 
= 168.96. The NPK effect is small in comparison to its lsd, and the effect is assumed 
to be negligible. The confidence interval for (NPK) is from 29.8 — 168.96 — —139.16 
to 29.8 + 168.96 = 198.76. 

The efficiency of the incomplete block design relative to the corresponding ran- 
domized complete block design is computed from formula (IX-1) corrected for the 
difference in degrees of freedom: 


64.2450 + 6(15.4404) (8 + 1) (is + 3) ii E 
7(15.4404) 12 + 3/ \14 +1 


or a gain of 43 рег cent in efficiency. Approximately four replicates of an incomplete 


Ae design would be equivalent to six replicates of a randomized complete block 
esign. 


А If the NPK effect is assumed equal to zero, the treatment totals are adjusted for 
differences in yield of the incomplete blocks by the formula, 
Xy = Mi (В) + (AB); + (Onr + (AO + (ВО q Х.... _ 6Х... 
4 8 


8 
1 

= 51X C0) + (HB) + (АВ) + (101-00) 

+ (-D***(40) + (—1)++МВС) + of (IX-3) 


Е F 
or example, the adjusted total for treatment 010 is equal to 


X.a = 9248 + 651.4 + 669.8 + 682.4 + 644.4 + 656.8 5(1317.0) 
4 8 


We = 159.275. 

е remaini Ч 

subtracting S EA SE are obtained similarly, or they may be obtained by 
9m all the treatments having a plus sign in the NPK row and 


ts follow the si f 4 š " 
treatment is a Sd ES енен Х лоо. No immediate explanation 


"The signs of the eff. 
for the large yield of Шы 
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adding 29.8/8 to the remaining treatments. The effect totals are unchanged. The 

NPK effect should be omitted from the table of effects [324, p. 20]. Again it is em- 

phasized that these adjustments should be made only if NPK is assumed equal to 

d Ordinarily, these adjustments are not made because it is unsafe to assume 
= 0. 


ІХ-3.3 COMPLETE CONFOUNDING IN THE 3" SERIES 


| Тһе 3? factorial may be arranged in incomplete blocks of three treatments 
with either the AB effect or the AB? effect confounded with incomplete block 
differences; thus, the arrangements are 


If one or the other grouping is followed in all replicates, this effect is com- 
pletely confounded with incomplete block differences. If AB and АВ? are 
both estimates of the same quantity, partial information is obtainable on the 
A X B interaction even if one of the components is completely confounded. 

If AB is confounded in all replicates, if the levels of AB are randomly 
allotted to the incomplete blocks in each replicate, and if a different random 
allotment of the treatments within each incomplete block is made, the analysis 
of variance is 


Source of variation df Sum of squares 
-1 X,.2/9 — X...2/07 


Replicat r- - 
AB CC 2 (AB)in?/3r — X..2/9r 
du X replicate = error (a) 2(r — 1) by subtraction 
2 

E 2 see Chapter VII 
AB: 2 | 
Residual = error (b) 6(r — 1) by subtraction 
Total (corrected for mean) 9r— 1 LUV Xai? — X..2/9r 


E, is used to test the existence of the AB effect, 
are is the error mean square for effects A, B, 
ete block design relative to the cor- 
design is (ignoring the difference in 


e error (a) mean square = 
ME the error (b) = E, mean squ 
rec. AB*. The efficiency of this incompl 
sponding randomized complete block 
egrees of freedom) 
[2(> — 1) + 215 + [6+ 60 — DIE _ А + 3E., IX 
1) 6216467 DIE. ДЕ, (En 
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Jf desired and if realistic, the treatment totals may be adjusted to zero AB 
effects by the formula, 
Xai a (А): + B (АВ гы; |. — 


X. 
usd (IX-5) 


As with the 2* series, this adjustment is not applicable if the AB effect is not 
equal to zero. The standard errors for the effect levels are obtained as before, 
except that E, is the error mean square for levels of A, B, and АВ?, while 
E, is the eiror for the levels of AB. 

The 3? factorial arrangement may be designed in incomplete blocks of 
three or of nine treatments. For example, suppose that only litters of size 3 
(or three homogeneous roasts) are available, and it is desired to compare 33 
treatments. It is necessary to confound 8 degrees of freedom with incomplete 
blocks. Therefore, suppose that we wish to confound AB, AC, and their 
interactions, AB*C? and ВС?, The following nine sets of three treatments are 
obtained л 


Treatments Level of effect 
000 122 211 (AB), (AC), (ВС:)„ (AB:C3), 
000 120 212 (АВ), (AC), (BC. (АВ?С2), 
002 121 210 (AB), (AC), (ВС), (АВ2С2), 
010 102 221 (AB), (АС), (BC), (АВ?С2), 
01 100 222 (AB), (AC), (BC), (АВС), 
012 101 220 (ABA, (AC), (BC? (AB?C?)o 
020 12 201 (AB), (AC), (BC2; (AB:C5) 
021 10 202 (AB), (AC), (ВС), (AB?C?)o 
022 11 200 (AB), (AC), (BC?) (AB'C?), 


= ee ee 


If the sets of t; 

Shin erat HH are randomly assigned to the nine incomplete blocks 
QUERI ur the por effects are confounded in all replicates, and 
analysis of variance is each incomplete block are randomly assigned, the 


1 pai 
ааб = 2 ABU = AB*C? + ВС? 
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Source of variation Degrees of freedom ms 
Replicate > т—1 
АВ 2 
АС 2 
BC: 2 
AB?C? 2 
Error (a) 8(r — 1) Es 
A 2 
B 2 
с 2 
АВ? 2 
AC? 2 
BC 2 
A XB x € — AB*C? 6 
Error (b) 18(r — 1) E 
"Total (corrected for the mean) 27r — 1 


The 3* factorial may be arranged in an incomplete block design with in- 
complete blocks of three, nine, ог twenty-seven treatments. The 3° factorial 
may be arranged in an incomplete block design with three, nine, twenty- 
Seven, or eighty-one treatments per incomplete block, depending upon the 
nature of the experimental material and upon the information desired on 
the various interactions. 


IX-3.4 COMPLETE CONFOUNDING IN OTHER FACTORIALS 


. Other p^ factorials may be arranged in incomplete blocks of p* treatments 
in the same manner as described for the 2" and 3” series [177]. It is also pos- 
Sible to arrange other factorial combinations like the k X p" X 4° in various 
incomplete block sizes [175, 194, 215, 217, 223-5, 228, 324]. As an illustration 
of complete confounding in a k X p" X 9" factorial, a 2* X 3? factorial arrange- 
ment of treatments in a time and motion study has been selected. Two weights 
9f a milk pail, two types of pail, two methods of carrying the pail, two heights 
9f pouring, three distances of carrying the pails, and three types of path 
(straight hall, a hall with one right-angle turn, and a hall with stairs in it) 
Were used in all combinations, resulting in 144 treatments.! Six boys were 
available to carry out the experiment. It was expected that they would differ 
in the characteristics observed. Therefore, the six boys were the incomplete 
blocks and each boy would perform twenty-four treatments. Also, it was sus- 
Pected that none of the interactions were of any appreciable magnitude. 
Various schemes of confounding could be used. The forty-eight treatments 
could be conducted by two boys at each of the distances (or types of path). 
his completely confounds a main effect. Also, one could use four boys and 
have each boy do thirty-six treatments. One particular scheme would be to 
Completely confound effects ABC, AD, and BCD with differences between 


тот а study Ьу W. Н. M. Morris, Cornell University. 
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boys. Alternatively, if each boy carries out twenty-four treatments, one effect, 
ABCD, could be completely confounded and some effects, EF and interactions 
with EF, could be partially confounded with the differences between boys or 
incomplete blocks. The plan and breakdown of the degrees of freedom for 
the latter design are presented in table IX-2. Since the experiment could be 
performed in less than one hour and since it was desired to utilize the services 
of all six boys, the experimental arrangement described in table IX-2 appeared 
to be the most desirable for this study. 
IX-3.5 APPLYING TREATMENTS TO STRATA WHICH ARE DESIGNED TO 
CONTROL HETEROGENEITY 

One consideration that occasionally escapes the attention of the experi- 
menter is that the purpose of stratification of experimental material into 
homogeneous subgroups is to control extraneous variation. If treatments are 
applied to the strata, then the meaning of the experimental results may be 
questionable; for example, suppose that we have a randomized complete block 
experiment of p levels of the factor a in r replicates. Furthermore, suppose 
that the experimenter applies r levels of the factor b to the r replicates. If 
an interaction, А X B, is present and if these levels represent the levels of 
interest, no valid estimate of the experimental error is available. The analysis 
of variance (see Chapter VII) is 


Source of variation df Expectation of mean square 
Replicate and B effect p сё + £(B effect and replicate) 
A p-l сё + f(A) 

Residual (error and A X B) (г– 1)(р 1) сё +f(A X B) 
Total тр—1 = 


where f (B effect and replicate) is a function of the factor b effects and the 
replicate effects, f(A) is a function of the factor a effects, and the f(A х В) 
is a function of the interaction of the factors a and b. It could turn out that 
the residual mean square is significantly larger than the other two mean 
squares. If there is a block X treatment interaction, the same effect could be 
observed. The experimenter must know his material and ascertain whether 
or not an interaction is present and whether or not this affects the tests of 
his hypothesis; i.e., whether the finite or infinite model is assumed. 

The same comments hold for the latin square. Suppose that we have three 


els of the factor a and that we apply three levels of factor b to the rows of 
a3 X 3 latin square; thus: 


lev 


„ 


TABLE IX-2. Experimental plan for a 2* X 3? factorial in incomplete blocks of 3 х 2° 
= 24 treatments 


Block III Block IV Block V Block VI 


Block I 


Block II 


i 000000 000010 000080 000100 000110 
2 000001 000011 000021 000101 000111 000121 
3 000002 000012 000022 000102 000112 000122 
4 001110 001120 001100 0c1010 001020 001000 
5 001111 001121 001101 001011 001021 001001 
6 001112 001122 0011% 001012 001022 001002 
7 010120 010100 010110 010020 010000 010010 
8 010121 010101 010111 010021 010001 010011 
9 010122 010102 010112 010022 010002 010012 
011000 011001 021002 011100 011101 0113100 
011012 011010 011011 oni 011110 oni 
011001 011022 011020 011121 011122 011120 
100100 100110 100120 100000 100010 100020 
100101 100111 100121 100001 100011 100021 
100102 100112 100122 100002 100012 100022 
101010 101020 101000 101110 101120 101100 
101011 101021 101001 101111 101121 101101 
101012 101022 10100 101112 101122 101102 
110020 110000 110010 110120 110100 110110 
110021 110001 110011 110121 110101 110111 
110092 110002 110012 110122 110100 110112 
111101 111102 111100 111001 111002 111000 
111110 111111 111112 111010 111011 111012 


111122 111120 111121 111022 111020 111021 


Analysis of variance 


Source of variation 


AxF 

BxC 

BxD 

BxE 

BxF 

CxD 

CxE 

CxF 

DxE 

DxF 

EF? 

ЕЕ! (partially confounded) 
Residual 


Two-factor interactions 26 
AxB 
Axc 
AxD 
AxE 


И) 


онны 


104 
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If we select one of the twelve 3 X 3 latin squares at random, the following 
analysis of variance is appropriate: 


Source of variation df Expectation of mean square 


Row and B effect 

Column and AB? effect 
Treatment = A effect 
Residual (AB effect and error) 


Total 


c + f(B and row) 

сё + f(AB* and column) 
сё + f(A) 

с + f(AB) 


с һә t5 ho to 


If there is an interaction of the factors a and b and if we are interested in these 
particular levels of the factors (the finite model), no appropriate error mean 
square is available. The same type of comment holds for incomplete block 
and other types of designs. Instead of obtaining more information, the experi- 
menter may actually end up with less by having no appropriate error mean 
square for making tests of significance. 


IX-4 Partial Confounding 


Partial information may be desired on all effects in a factorial experiment 
with a large number of treatments, and at the same time a small block size 
may be desirable. À scheme of confounding may be followed in which some 
effects are confounded in one replicate, other effects in the second replicate, 
and still others in the remaining replicates. The scheme of partial confounding 
allows for within incomplete block information on all effects. 

"The treatment means are always adjusted in partially confounded arrange- 
ments. The reason for adjustment is that estimates of effects are obtainable, 
and, hence, the effect of the incomplete block is estimable. The treatments 
within an incomplete block receive a negative adjustment if the incomplete 
block effect is positive and a positive adjustment if the incomplete block 
effect is negative. These adjustments tend to remove the mean effects of the 


Fior pete blocks, resulting in a more reliable estimate of the treatment 


IX-41 PARTIAL CONFOUNDING IN THE 2» SERIES 
SEL oo experiment with four replicates, information may be 
PL ne effects, It may be considered that blocks of eight plots 
efficient. The ape geneity and that blocks of four plots would be more 
interest on the A » Y 5 B, and C are of most interest, with the secondary 
of any of the interactio; C, and ABC interactions. Complete confounding 
be confounded with mos may be undesirable. The interaction ABC could 
9ck differences in replicate I, AC in replicate II, BC 
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in replicate III, and AB in replicate IV. Some such field plan as the following 
might be obtained: 


Interacti 
Раана АВС АС BC AB 


Rep. I Rep. II Rep. III Rep. IV 


where the confounded effect is allotted to the replicate at random and the 
level of the effect to the incomplete block within a replicate at random. The 
treatments within an incomplete block are assigned to the plots at random. 
For such a design the effects AB, AC, BC, and ABC are estimated from the 
three replicates in which they are unconfounded, Thus, the interactions are 
evaluated from twenty-four plots instead of the entire thirty-two plots, as 
Эге the main effects А, B, and C. The amount of information on the inter- 
actions, ignoring interblock comparisons, is then 34. The breakdown of the 
degrees of freedom for the above design is 


Source of variation 
Replicate РЗИН, MEE 
Blocks within replicates (ignoring treatments) 4 E, 

Teatment (eliminating block effects) 1 А 

Main effects from all replicates К : 

Interactions from replicates in which effect is unconfounded 4 
Residual 17 E 
Residual MEM VEN “АЕ x —— ————-— 
Total 31 T. 


UO _ “УТ XUI s 


The gain in precision due to confounding is (ignoring the difference in 
degrees of freedom) equal to 
4E, + (171 + Т)Е, Ei 6E, (IX-6) 
(4% 7 + 17)Е, ТЕ, 

if the confounded effects are assumed to be negligible [324, p. 31]. If not, 

the treatment effects must be removed from E, before insertion in (IX-6). у 
The following relationship among sums of squares holds: blocks within 
replicates (ignoring treatments) + treatments (eliminating blocks) — blocks 
within replicates (eliminating treatments) + treatments. (ignoring blocks). 
he treatment (ignoring block) sum of squares is obtained in the same manner 
as for a randomized complete block design. The blocks-within-replicates 
(eliminating treatments) sum of squares is computed from the above relation- 
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ship, and the corresponding mean square Е, is computed. Since the expected 


value of Ey’ is eê + €— Dus, an unbiased estimate of c2 + ko? is obtained 
from the formula, 
Ey — Е, 
SS Буи, (x-7) 


where r = number of replicates and k = number of treatments in an in- 
complete block. The value Ej" is substituted for E, in equation (IX-6) to 
estimate the efficiency of the incomplete block design with recovery of inter- 
block information relative to the corresponding complete block design; if 
E" is inserted for E, in formula (IX-6), the efficiency of the two designs is 
equal to (4Е + 17E,)/21E,. 

The treatment totals are adjusted by the formula!: 


ХА. (А) (ВО) 


"SC D*(AB')H-(- DAC) (C) (BC? -(- "(4 BO] 


(А) ();4- (0), M (AB) 4; +(AC)) inn + (BO); (ABC) si 
4 3 


р ах) 


where the effects with a prime are derived from the replicates in which the 
effects are unconfounded with incomplete block differences. This type of 
analysis does not make use of the interblock information. The method for 
шл interblock information is described in section IX-4.5. 

If, in the above example, the effect ABC is confounded with incomplete 


block differences in replicates I and IV and if AB is unconfounded, formula 
(IX-8) becomes 


Ху = AUE OY (АВ) + (Су, | (АС), + (ВС) 
3 


d "Pu 
чаа (IX-9) 
они 18 estimated from replicates II and III. The analysis of 
partial confoun mua accordingly. The type of arrangement is unbalanced 
ва зу Cochran and Сох [60, plans 6.1 to 6.6], Barnard [11], 
Benedi m E p mae pent schemes of confounding for 2” factorial 
confounding are balanced ; reatments. A number of the schemes of 


tion by amount of intrablock information, e. g LN 
» e.g. 34 чу a 
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set of arrangements for 2‘ treatments in blocks of 2? plots [324, table 19] in 
four replicates results in the following: 


Replicate Effects confounded 
I AB, CD, ABCD 
II AC, BD, ABCD 
ш AD, ABC, BCD 
IV BC, ABD, ACD 


All two-factor interactions and all three-factor interactions are confounded 
with incomplete block differences in one of the replicates. Thus, three-quarters 
of the total information, ignoring interblock information, is obtained on the 
two- and three-factor interactions. One-half of the information is obtained 
оп the four-factor interaction. 


Example IX-2. An experiment was conducted in the greenhouse at Ames, Iowa, 
to determine the effect of two types of soil (Бо = soil mixed with sand and b; = soil 
with compost added) and two levels of soil moisture (co — dry soil and c; — wet soil) 
9n the yields of hay from two soybean varieties, ao and a;. The scheme of confounding 
18 the same as for the design in the first part of this section. The three two-factor 
interactions and the three-factor interaction are each confounded with incomplete 
block differences in one of the four replicates; the scheme of confounding is balanced 
Partial confounding. 

The analytical procedure for the first part of the analysis follows that for the 
2° factorial in a randomized complete block design (see example VII-2). In addition 
to computing the effects over all replicates, it is necessary to compute the interactions 
from the replicates in which these effects are unconfounded with incomplete block 
differences. The AB’ effect total (the prime is used to signify that full information is 
Not available on the effect) may be computed directly from replicates I, III, and IV 
(AB is confounded in replicate II), 355 — 334 = 21, or from the total effect minus 
the difference in the block totals from replicate II, (AB)o — (АВ): — (II, — Is) 
= 481 — 464 — (126 — 130) = 21, or the AB effect may be computed in each repli- 
cate in which it is unconfounded and summed; thus, (106 — 112) + (108 — 104) 
+ (141 — 118) = —6 + 4 + 23 = 21 (see bottom portion of table IX-3). The ad- 
Justed effect totals (AC’), (BC’), and (ABC’) are computed similarly and are given in 
table IX.3, 

The analysis of variance for these data is given in table IX-4. The total, replicate, 
and treatment (ignoring block) sums of squares are computed in the usual manner for 
a randomized complete block design. The treatment (eliminating block effect) sum 
ОЁ squares is obtained by computing the sums of squares for the adjusted effects in 
the right-hand column of the middle portion of table IX-3. The sum of squares for A 
18231401 4- 1 +1 -1+ 1+ 1+ 1+1) = 1667.531. The sums of squares for B and 
Care obtained similarly. The divisors for these sums of squares are identical with those 
described in Chapter VII. The sum of squares for AB’ is 212/3(1 -- 1-- 1-- 1-F 1 
+1 1) = 18.375. The divisor is 24 instead of 32, since r = 3 replicates аге used 
to obtain this sum of squares instead of r = 4 replicates. The sums of squares forAC’, 
BC’, and ABC’ are obtained similarly; thus, (—9)2/24 = 3.375, 12/24 = 0.042, and 
24*/24 = 24.000. 
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The blocks-within-replicate (ignoring treatment effects) sum of squares is equal to 


103° + 115? 219 , ||, 143? + 116? _ 259 
4 8 4 8 


(103 — 115)? + (126 — 130)? + (99 — 113)? + (143 — 116)? 
y 8 


= 135.625, with 4 degrees of freedom. 


TABLE IX-3. Yields (ounces) of soybean hay for a 2? factorial in an incomplete block 


Se 


000- 110- 3 011- 31 | 100- 28 101- 31| 1u- E 000- 11 
111- 39 10l- 51 | 000- 19 101- 36 | 001- 24 010- 19| 100- 31 011-29 
100- 30 00l- 21 | 111- 41 010- 30 |110- 35 000- 13| 00l- 21 101- 33 
oll- 7 110- 39 | 001- 30 100- 33 |011- 26 111- 36| 010-25 110- 5 


Totals 
Differences 
Rep. totals 


Х оло *.110 Х.001 Х.101 
98 155 96 151 


A 


§ IX-4.1] Partial Confounding in the 2" Series 249 


TABLE IX-4. Analysis of variance of data in table IX-3 


Source of variation 


Replicate 

Blocks within replicate 

Treatment (eliminating block) 
A 


ABC 
Residual 


Treatment (ignoring block) 
Block (eliminating treatment) 


70.7 
Total 31 3605.97 
Correction for mean 1 27907.05 


The blocks-within-replicate (eliminating treatment effect) sum of squares is ob- 
tained from the relation given in the preceding section, 135.625 — (2759.72 — 2694.885) 
= 70.79, or from the formula, 

КАВ), — 41.) + КАВ), — ап)? М. [X.... — 4Xn...]? 
К1+1+1+3°) 2k(12) 


+ КАО — 4Ш„]? + (AC: — 4]? _ [X.... — 4Хп...] 
12k 24k 


4 80) = а) + (ВО. = 4 _ [X = 18. 
12k 24k 


+ [ABO] — ату + АВО)» — 45] _ [X-— — Xv хло) 
19k 24k 
(—23): + (—56)? + 65? + 32° + 55° + 18° + (=74)? + (-10* 
_ (= 23)? + (—56)? + 65? + 32° + 55° + 18° + (—74)? + (17)? 
48 
_ (=79)? + 97° + 73? + (—9D* _ 70.79, with four degrees of freedom. 
96 


This sum of squares represents the contrast of the level of the effect in the uncon- 
founded replicates with the level of the effect in the replicate in which the effect is 
confounded. The above method of computing the blocks-within-replicate (eliminating 
treatment effects) sum of squares is the one presented in Chapter XI for lattices. 
The residual sum of squares is obtained by subtracting the block (eliminating 
treatment) sum of squares from the total minus replicate and treatment (ignoring 
lock) sums of squares, 3605.97 — 228.59 — 2759.72 — 70.79 = 546.87, with (31 — 3 
— 7) — 4 = 17 degrees of freedom. The block X treatment sum of squares from the 
randomized complete block analysis is partitioned into two parts, block (eliminating 
treatment) = 70.79 and residual = 546.87. The residual sum of squares may also be 
computed directly. To do this, it is necessary to compute the effects in each replicate 
and to set up tables like those given in the bottom part of table IX-3. The residual sum 
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of squares is the sum of the interaction sum of squares in each of the two-way tables, 
divided by the appropriate divisor: 


60? + 367 + 48? + 87? 231? л 40? + 20? + 20° + 672 147° 
8 


Al +) = 8 32 EE 
2 z 
IE DEP _ ZE = 54687, with3 434342424242 


= 17 degrees of freedom. 


This method of reconstructing the residual sum of squares is quite instructive and 
provides a check on the correctness of the computations of the other sums of squares. 

If it is desired to compute the adjusted treatment totals, formula (IX-8) may be 
used. For example, the adjusted total for treatment 010 is equal to 


Xoo! = (945 — 231 + 147 — 99} + 1(—21 + (—9) — 1 + 24} 
= {(357 + 546 + 423 — 3(945)} + 1(334 + 362 + 363 + 355] = 94.08. 
Yates [324, p. 21] presents another procedure for obtaining the adjusted total, 


Xo | - [SP - a. - 1] - [22 - ar. - mj] 
т [22 - ar, - m] + [450 - av, - 1v]t 


1(—37 à d 
= 98 + "E =й ый 1} = 94.08. 


The signs are obtained by noting whether the particular treatment falls in the plus 
or minus group for the confounded effect. The effects are divided by 3, since they are 
derived from three replicates and the mean effect per replicate is compared with the 
effect in the replicate in which it is confounded. The other adjusted treatment totals 
may be obtained in a similar manner. The adjusted treatment means are obtained by 
dividing by r = 4 = the number of replicates. 

The adjusted effect totals may be reconstructed from the adjusted treatment totals 
if desired. This provides a check on the adjusted treatment total. The values for main 
effects A, B, and C are unchanged. The value for (AB’) is computed as 


4056.67 — 118.42 — 94.08 + 153.83 + 92.08 — 131.83 — 114.17 + 183.92) 
= $028) = 21. 


The plus and Pus signs are obtained from the AB row in table ІХ-3, and the factor 
4 18 necessary because the AB effect is evaluated in only 34 of the replicates. The 

effect totals for AC’, BC’, and ABC’ may be obtained similarly. 

block design relative to a randomized complete 

t, since Еу is less than E,. The design might be 

te block design [see 175, sec. 23.7]. If it were con- 

effects would be made block design, no adjustments to the treatments or 


The standard Е A 
Nous ard error of the main effect totals is equal to :/32.169(16 + 16) = 32.1. 


d 
Td error of the totals for the effects which are partially confounded is equal 
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to V32.169(12 + 12) = 27.8. The average intrablock variance of a difference between 
2j 31 ТЕ : 
| two adjusted means is dtm + a = 25£./42 (from eq. (1X-17) with w = 0). 


IX-42 PARTIAL CONFOUNDING IN THE 3" SERIES 


The 3? factorial may be arranged in blocks of three experimental units 
with different effects confounded in each replicate. Suppose that it is desired 
to confound АВ with incomplete block differences in two of the four replicates 
and to confound AB? with incomplete block differences in the other two 
replicates. (The plan for this scheme of confounding is given in section 
1X-3.3.) Full information is available on the main effects, and one-half 
information (ignoring interblock information) is available on the two com- 
ponents of the interaction. With a random allocation of the sets to the repli- 
cates, of the groups to the incomplete blocks, and of the treatments within 
the incomplete blocks, the analysis of variance is 
Ес ee 

| Source of variation Degrees of freedom ms 
— Ó—AaÓÓÁ—Á— 09 7*0 0 0700 
Replicate 


3 
Block (ignoring treatment effect) 4xXx2=8 E, 
"Treatment (eliminating block effect) 8 


B 
AB' 
AB" 
Residual 16 
A X replicate 
B X replicate 
AB X replicate in which AB is unconfounded 
AB? X replicate in which AB? is unconfounded 


VENN 
B 


оюсу с 


Total (corrected for the mean) 2 


The effects are evaluated from the replicates in which they are uncon- 
founded, The adjusted treatment totals for this particular scheme of con- 


founding аге obtained from the formula 


‚ (c (В); ‚ (АВ) + (АВ ы Х... Е 
ху = BY | Чы Чарчы э ay 


Where the levels of AB and AB? are evaluated from the replicates in which 
these effects are unconfounded with incomplete blocks. The standard error 
9f a difference between two adjusted means is V/3E;/r. The residual sum of 
Squares may be computed directly as for the 2" series. Also, the sum of squares 
for block (eliminating treatment effect) may be computed by either of the 
Methods described in the previous section. 

In the 33 factorial, blocks of either three or nine may be used. Suppose 


17 


252 Confounding in Factorial Experiments [§ IX-4.2 


that only partial information is desired on the three-factor interaction, 
A X B X C, and that blocks of nine are to be used. Furthermore, suppose 
that four replicates are to be used. Then, ABC may be confounded with sets 
of nine treatments in one replicate, ABC? in a second replicate, AB*C in a 
third replicate, and АВ?С? in a fourth replicate. Such a scheme would allow 
blocks of nine experimental units to be utilized and would allow for full in- 
formation on main effects and two-factor interactions and for three-fourths 


information on the three-factor interaction. The analysis of variance table 
for such a design is 


a СЕЕ l-—  S— кш: 
Source of variation 


Degrees of freedom ms 

Replicate 3 
Block (ignoring treatment effect) 8 E» 
Treatment (eliminating block effect) 26 

Main effects (A, B, C) $ 

Two-factor interactions (A X B, A X C, B X C) 12 

ABC' 2 

ABC 2 

AB:C' 2 

ABC? 2 
Residual 70 Е, 
Total (corrected for the mean) 107 


The formula for obtaining the adjusted treatment totals is 
0. J 

Ха = (А), + (В), + (АВ); + (ABP ass + (© (AO ian + (ACen 
+ BO) + (BC?) з} ЕЕ sa! (ABC) зь + (ABC) iso 
+ (АВ!С'у +» + (ABC) 254204 — 5Х... . (IX-12) 


The average variance of a difference between two adjusted means is 43E,/18 
and between two adjusted totals is 344Е,/39. 


їп н ae ie of nine experimental units are not homogeneous, and 
E EE omogeneity it is necessary to use incomplete blocks of 
abet Ho eR al units. This means that 8 degrees of freedom or four effects 

confounded with incomplete block differences in each replicate. In 


order not to confound a i 
Н тат effect, to use fou: i i 
р r replicates, and to obtain 


ible on the two-factor int i win 
Scheme of confounding is suggested : ае аноде the-tolow ae 


AB: AC and interactions in one replicate, 

ABE КО “ a second replicate, 

AEN 3 “ а third ^  ,and 
, AC2, “ а fourth * 
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This scheme confounds each component of the two-factor interactions with 
incomplete block differences in two of the replicates and each component of 
the three-factor interaction in one of the replicates. This results in one-half 
information on the two-factor interactions and three-fourths on the three- 
factor interaction. The analysis of variance for this design is 


———M—— MÀ MÀ e M Л 


Source of variation Degrees of freedom ms 
tii н И c co 27. 
Replicate 3 
Block (ignoring treatment effect) 32 E, 
Treatment (eliminating block effect) 26 

Main effects (A, B, C) 6 
Two-factor interactions (AB', AB”, AC’, AC”, BC’, BC?) 12 
Three-factor interaction (ABC’, ABC”, AB?C’, AB?C”’) 8 
Residual 46 E, 
ge POMMES NR E RENE 05 ERR 
Total (corrected for the mean) 107 


The formula for obtaining the adjusted treatment totals for this particular 
design is equal to 


Ха = 5104), + (B); + (Cn — 4X. b+ SAB) + (АВ) 


4 n 
+ (AC) inn + (АС) а + (ВО) + (ВО) + оу (ABC insp 


+ (ABC), да» + (ABC) iajn + (АВАС) ана |. (IX-13) 
The average variance of a difference between two means is 17E./26. 
Other situations for the 3" series of factorials in which certain effects are 
Partially confounded with incomplete block differences may be treated 
Similarly, 


IX-4.3 PARTIAL CONFOUNDING IN OTHER FACTORIALS 


The р" system of factorials may be arranged in incomplete blocks of p° 
treatments by partially confounding effects of less interest. The procedure 
follows that outlined for the 2" and 3" systems [177]. ч 

Partial confounding of some effects in thek X p" X 0" system of factorials 
Presents considerably more analytical difficulties. A full discussion of this 
type of partial confounding is not attempted herein. For a more complete dis- 
cussion the reader is referred to the writings of Yates [324], Kempthorne [175, 

h. 18], Nair [215, 217], Li [194], and Nair and Rao [223-5, 228]. For the 
Present discussion, partial confounding schemes for the 3 X 2? and for the 
MS factorials in blocks of six treatments are illustrated. 

For the arrangement of the 3 X 2 X 2 factorial in blocks of six with the 

BC and A X B x C interactions partially confounded with incomplete block 
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differences, Yates [324, sec. 13] proposes the following balanced arrangement 
for the treatments a;b;c, = ijh: 


'The main effects are unconfounded in each replicate, since all levels of each 
factor appear in each incomplete block. To observe which effects are confound- 
ed, it has been found instructive to set up the above table in another manner: 


(ВС), (ВС) ao(BC)o а(ВС), а (ВС) а(ВС), 


а (ВС), a(BC) а (ВС), ai(BC)o a(BC)o ai(BC); 
a:( BC), ax BC); 82(BC)o &a( BC). a:( BC): ax (BC), 


In the above form, it will be observed that the BC effect at the zero level of 
factor a (а«ВС or BCao, which is equal to $80)» — (BC);)) is also a between 


incomplete blocks contrast. The BC effect is not orthogonal within incom- 
plete blocks, since either the one or zero level is repeated twice in an incom- 
plete block and the other level appears once. The 3 X 2? is a mixed-primes 
factorial, and the modulo notation described in Chapter VII is not appropriate 
for all three factors but is for the 2? part of the factorial. This fact facilitates 
the construction of partially confounded designs for a factorial arrangement 
of treatments, 


The 2X 3? factorial may be arranged in blocks of six treatments. The 
тошо notation of Chapter VII applies to factors b and c but not to all three. 
E: ES arrangement results in partial confounding of the BC component 

е B X C interaction and of the A X B X C interaction: 


I, 


ъ(ВС» | aBC) | 
а(ВС), 


1 


ao(BC), 
а (ВС), | a(BC), 


II, 


авс) | «(BC | 
ai (BC) a,(BC); 
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which gives the following sets of treatments in the incomplete blocks: 


If an additional two replicates are available, the ВС? component should be 
confounded in the third and fourth replicates, resulting in equal confounding 
on the 4 degrees of freedom for ће B X C interaction and in equal confounding 
on the 4 degrees of freedom for the A X B X C interaction; the arrangements 
are 


III, IHi, ПІ, IV. IV 1У, 
a(BC?) | a(BC2, | a(BC?)} | ao(BC%): | a(BC2» | а(ВС?), 


a (BC?), а(ВС?), а(ВС?), а(ВС?), а(ВС2), а(ВС?), 


А numerica] example of the above design and instructions for calculating the 
analysis of variance table are given by Cochran and Cox [60, sec. 6.19]. 


2 Example IX-3. АЗ X 2 X 2 factorial arrangement of applications of nitrogen and 
clippings in the fall and spring was used by М. L. Peterson [246] to study the effect on 


Yields of hay from alfalfa. The treatments were 
а = no nitrogen, 


а = 60 lbs. of nitrogen applied on March 15, 


аз = 30 Ibs. of nitrogen on Sept. 15 and 30 lbs. on March 15, 


b; = no spring deferment of clipping, 
bı = spring deferment of clipping, 
co = no fall deferment of clipping, and 


сі = fall deferment of clipping 


all combinations. The twelve combinations were laid out in three replicates with 
“complete blocks of six plots. The interactions A X B X C and B X C were partially 
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confounded with incomplete block differences. The specified group of treatments was 
assigned to the incomplete blocks within a replicate at random, and the treatments 
were randomly assigned to the plots within the incomplete block. The systematic 
arrangement of the treatment combinations and the yields of alfalfa in grams per plot 
are given in table IX-5. The treaiments are arranged in a partially balanced incomplete 
block design in such a manner that the B X C and A X B X C interactions are par- 
tially confounded with block differences in each replicate. Interaction B X C is con- 
founded as little as possible in this design. А 
The statistical analysis for the 3 X 2 X 2 factorial experiment on alfalfa yields 

is first made ignoring the confounding (324, p. 39]; i.e., assuming the twelve treat- 
ments were completely randomized within blocks of twelve plots. Most of the calcu- 
lations are required for the analysis of the incomplete block design; also, the p 
may observe the differences in the analyses for the randomized complete block © 
for the incomplete block design. The totals (table IX-6) for the twelve treatments 
are obtained in the usual manner. Summing the treatment totals over the factor а, 
the totals in column 5 are obtained. The four figures in column 6 are obtained as 
follows: 

789 + 771 = 1560; 

947 + 882 = 1829; 

771 — 789 = —18; 

882 — 947 = —65. 


The figures in columns 7, 8, and 9 are obtained in the same manner from columns 
3, 4, and 5, respectively. Columns 10, 11, 12, and 13 are obtained by the same procedure 
applied to columns 6, 7, 8, and 9, respectively. For example, the figure 15 in column 
13, which is the B effect = (B) — (B),, is obtained from column 9 as 85 4- (—70) 
= 15, or as —83 — 156 + 254 = 15. Several checks for the computations in the top 
portion of table IX-6 are available. The sum of each of the columns 2, 3, 4, and 5 
should equal the first figure of columns 10, 11, 12, and 13, respectively. Likewise, 


the sum of the numbers over the factor a serves as a check for the figures obtained by 
the sum and differences. 


The four tables at the bottom of table IX-6 m 
A X C, B X C, and A X B X C interactions, 
the top part of table IX-6. The sums of squar 


ay be used to compute the A X В, 
or they may be computed directly from 
es of the main and interaction effects are 


A; Alot + (A)? + (A)? _ Х...2 _ 3389? + 6296? + 60902 — 15775? 
“Se ae = = 


36 12 gg = 438,569.06. 
; B? + (B)? X? (В), — (В): is 

B: 18 3677 36 = 36 = 625. 
„(Оо + (O? _ xc - 2 2 

ве (Ог _ нен =O E шй GOES M 


A x B; (Ва)? + (Ba + (Ba)? (py 
12 36 


= (= 83)? + (—156)° + (254): 152 
pat 20 = = = 7,972.17 


152 


TABLE ІХ-5. Sysiematic arrangement of yields (grams) from a 3 X 2 X 2 factorial experiment in three repli- 
cates with incomplete blocks of six plots (The interactions B X C and A X B X C are partially 
confounded with block differences) 


Replicate I Replicate II Replicate III 


Block number Block number Block number 


Block total 


Rep. total 


Grand total 


"Treatments are designated by the subscript ijh of the treatment combination ась, where i = 0, 1, 2; 
j =0,1l;andh = 0,1 


Yield. : 


TABLE IX-6. Treatment combination totals and calculations for estimating the main effects and interaction effects from the 3 X 2 X 2 factorial 
experiment and alternative tables for computing the interaction sums of squares (Treatment combination а; сь = ijh) 


7190 = (С) 


3389 = (A)g “6296 = (A), 


Sums and differences Effects 
E Sum a а. 
2830 7190 
6585 


Yielà (grams per plot) 
а, ар 


о 

2800 
3496 
-60 
-96 


Sum 


6090 15775 = total 
(8) 
(c) 


(вс) 


25h 


15 
1295 


4350, 


6090 = (A), 15715 = total 


7880 = (в) 
1895 = (В), 


8585 = (С), 


Total 3389 = (А) 6295 = (А), 6090 = (A) | 15775 = total 
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TABLE IX-7a. Analysis of variance for the 3 X 2 X 2 factorial experiment of table IX-5, 
neglecting the effects due to confounding 


Source of variation 88 
Replicate 199964 .06 


Treatment (ignoring block) 513366 .32 
1905627 06 


xB 
xc 
xc 


(uncorrected) 
AxBxC (uncorrected) 


Residual ; 89125.26 
80453.64 
6912517 -36 


T(15971.00 


Source of variation 


Replicate 19996%.05 99982 .05 
Blocks within reps. (ignoring trt.) 16762.75 5587.58 


Treatment (eliminating block) 515550.72 = 
21528.55 
6.25 
5056.25 
5986.08 
3815.08 
3655.12 


corrected) 610.85 


x С? (corrected) 
5809.27 


A хс; (Cao)? + (Сај? + (Ca)? _ (O* _ 269° + 6962 + 430° _ 1895" _ 7,750.17, 


12 Case 12 36 
2 2 Х..2 [(BO»— (BOT _ 
B X C (uncorrected): (вои OX === - [B0 60r. m = 661.36. 


2 + (BCay)? + (BCa2)? _ (BC)? 
A X B X C (uncorrected): (BC) t (BO E e ~ 86 


_ (—41)#+(—36)#+ 238? 159 _ 4,345.06. 

= 12 36 
The sum of the above sums of squares = 513,366.32. As a check the sum of squares 
for treatments is 


Tost + 1430 +. + 1700 + ES K. ишге = 513,366.31. 
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The total, replicate, and error sums of squares are obtained in the usual manner. The 
analysis of variance, neglecting the effects due to confounding, is presented in table 
TX-7a. 

The sums of squares for BC and A X B X C contain some effects due to block 
differences. The next step is to eliminate the block effects from the estimates of (ВО), 
(BO), (BCa), (BCa), and (ВСаз), and from the corresponding sums of squares. 
These sums of squares, added to those for effects A, B, C, A X B, and A X C, yield 
the treatment sum of squares adjusted for block effects (table IX-7b). Yates [324] 
adjusts the BC effect by adding a correction to the unadjusted effect; thus: 


30 = 3[BC(unadjusted)] + Ij — Ia + II, — Ia + III; — Ш, 
= 155 + 2454 — 2300 + 155 + 2246 — 2260 + 155 + 3468 — 3037 
= 3(155) + 154 — 14 + 421 = 1026. 


The adjusted sum of squares for BC is 10267/288 = 3655.12. 

Before proceeding further, an explanation of the divisors, etc. is in order. Tu the 
first place, the quantity Q is unaffected by block differences. This is apparent from the 
following, where the treatment yield is designated by the treatment number and the 
replicate number by I, П, and Ti: 


3Q=3(BC) (unadjusted) -- E; — Iz-- Ts — U+ — IT, 
= 3(000-1-011--100--1114-2004-211 —001—010—101—110—201— 210) (I-- TI --TIT) 
--(000-L-011--101-I-1104-201-4-210 —001—010—100—111—200—211)(I) 
--(0014-010--1004-1114-2014-210—000 —011—101—110—200 —211)(II) 
-- (001-1-010-1-101-.-1104-200-.-211 —000—011—100— 111—201 —210)(IIT) 


= [000(31--31I 4-3III--I— II — IIT) 
3-011(31--311 4-31IIH-I — IT — Ш) 
+100(3I1+-31f+ 31 —I+ IT — Ш) 
+111(31+-311+ 31 — I+II- HT) 
+200(31+ 311+ 301 — I— IT -- HIT) 
+211(31+ 311+311—I—II+IID)] 
—[001(31+3I1+3111-+I— II—III) 
-FO010(3I--3II--31III--I— IT— IIT) 
+101(31-+311+ 3111 — I-II — IIT) 
TEHO(3I2-3II 4-301HI — I+II — Ш) 
+201(31-+311+ 3111 — I — IT4- IHT) 
+210(31+ 311+ 3111 -I —II--IIT)] 


= 1000(41--2114-2111) +011(41+-211+ 2111 
+100(21-4 411-4 2111) 4111214 411 +2) 
КОШ aU 4111) +211 (21-4211 +41) 
seen OTEA + 010(41-+-211-+ 2111) 
SN +П+-2Ш)-++110(21-Е411--әШ) 

( L-E2TI-EATIT) 1-210 (21--2TI--4TIT] 

= (BC), adjusted — 
From the above, it is 

QUOI. apparent that the quantity 3Q is the difference between two 

5 composed of 48 plot yields. Therefore, the quantity 


(BC); adjusted = (BC) adjusted. 
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30/48 = 0/16 is the BC effect adjusted for the effects of incomplete blocks on a single 
plot basis. The divisor for the sum of squares of the quantity Q is (2(6)/3*) (£ + 2? + 
22) = 32, and for 3Q is 2(6) (4? + 2? + 2?) = 288. The sum of squares for the BC effect 
adjusted for block differences may be compared with the BC effect and sum of squares 
from the unconfounded experiment. The BC effect, (unadjusted) on a single plot basis 
is (BC)/18 = [(BC)o — (ВС):]/18, and the sum of squares is [(BC); — (BC);]*/36. 
The error variance for the BC effect for the unconfounded experiment is 2E,/18 
= E,/9 and is 2E,/16 = E,/8 in an experiment with the confounding as described 
above. The loss in information due to confounding, if there is no reduction in the error 
variance, is 1 — 1/9/1/8 = 1 — 8/9 = 1/9, since the relative information from the 
two designs is E./9/E./8 = 8/9. If there is a sizeable reduction in the error mean 
square, then the BC interaction may be estimated more accurately in the incomplete 
block design than in the randomized complete block experiment, even though it is 
partially confounded with incomplete block differences. 

The estimate of the A X В X C interaction is obtained similarly. This interaction 
is expressible as the variation of the BC interaction over the three levels of the factor 
а. The notation (ВСа;) means that the BC interaction is estimated at the zero level 
of factor a. In a like manner, (BCa;) and (BCaz) represent the BC interaction at the 
Second and third leveis of the factor a. The comparison among the three quantities, 
corrected for the mean, yields the A X B X C sum of squares with 2 degrees of free- 
dom. Since these quantities contain block differences in addition to the effect, it is 
necessary to adjust them. The adjusted quantities are 


3R, = 3(BCap) unadj. + Ia — Ty + Ue — II, + III, — Ш, 
= 3(—47) — 154 — 14 + 421 = 112. 

3R, = 3(BCa;) апад). + Ip — Ia + Ha — Ih + Ш -H 

3(—36) + 154 + 14 + 421 = 481. 

ЗА, = 3(ВСаз) unadj. + I — I, + II; — II; + III, — HII; 
= 3(238) + 154 — М — 421 = 433. 


Аз a partial check the sum is equal to 
3R, + 3R + ЗА, = 30 = 112 + 481 + 433 = 1026. 


The adjusted BC effects at the three levels of a on a single plot basis are 
FGR — Q + 3m — Q + 3R: — 0 
= Larz — 026 + 481 — 342 + 433 — 342) 


= —23.0 + 13.9 + 9.1 = 0. 


The coefficient 10 in the denominator may be verified by the process given above 
for adjusting the BC effect. In the unconfounded experiment the BCap effect on a 
single plot basis would be (1/6)(BCa0)unadj. — (BC)/18 = (— 47 — 155/3)/6, the 
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BCa, effect would be (1/6)(BCa;)unadj. — (BO)/18 = (— 36 — 155/3)/6, and the 
BCa effect would be (1/6)(BCaz)unadj. — (BC)/18 = (238 — 155/3)/6, with the 
error variance 2E,/6 = E./3. In this incomplete block experiment the error variance 
of the adjusted effects is V(R.dev.) = (3 X 2/10)E. = 3E./5. Hence, the ratio of the 
relative information on the effects ВСав, ВСа, and ВСаз deviations in the two designs 
is (1/3)/(3/5) = 5/9 if there is no reduction in the error variance. The loss in informa- 
tion is 1 — 5/9 = 4/9. Yates [324] points out that the loss in information on each 
degree of freedom confounded adds up to the single degree of freedom confounded 
with block differences, 1(1/9) + 2(4/9) = 1. This feature is a property of the balanced 
arrangements described by Yates. 
The sum of squares for the ABC effect adjusted for block differences is 


1 2 д 
a9 (3Ro — Q)? + (3В, — Q)? + (38 - Q?] = agi - 230) -+ 139? + 912] = 1341.70. 
The treatment sum of squares adjusted for block differences is the sum of the 
effect sums of squares unaffected by block differences plus the sums of squares for the 


partially confounded effects adjusted for block differences; thus: 


(438,569.06 -+ 6.25 -+ 54,056.25 + 7972.17 + 7750.17) 
4- (3655.12 + 1341.70) = $13,350.72. 


The blocks-within-replicate sum of squares is 


=)? п, il)? = 2 
(I, — Ia) 4 Gh Mo)? , (ID, IIT.) 


2k 2k 2k 
1542, (=)? 42 _ я 
= tag ts = 1676275. – 


The sums of squares are summarized in table IX-Tb. 

The blocks-within-replicate sum of squares adjusted for treatment effect is ob- 
tained from the following relation of sums of squares: blocks unadjusted + treatments 
adjusted = blocks adjusted + treatments unadjusted. Therefore, the sum of squares 
for blocks-within-replicates adjusted for treatments = block unadjusted + treatments 
adjusted — treatments unadjusted = 16,762.75 + 513,350.72 — 513,366.32 
= 16,747.15, with the resulting mean square of 16,747.15/3 = 5,582.38 = Еу. 

i The efficiency of this incomplete block design relative to what it would have been 
ad a complete block design of twelve plots been used is 


3E, —E 
3 =) + Q1 + 19)Е 
= OLEH. _ 9(5582.38) + 57(3809.27) _ 106 А 
+и + 19)Е, 66(3809.27) = per cent, 


or an increase in efficiency of i iti 
founded effects are negligible. 6 per cent. In this case, it is not assumed that the con- 


In the event that it i i d 
into eleven inde; it is desired to partition the 11 treatment degrees of freedom 


d ng 2 
for the A улау constrasts, it is possible to do so. The 2 degrees of freedom 
partitioned into linear and quadratic effects, and the inter- 
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actions of these contrasts with the other factors yield the eleven comparisons listed 

below: 

ЮЫН o  ——————— 
Treatment combinations 


Effect | 499 100 200 010 110 210 001 101 201 01 11 2H Divisors 
М, = 9 .—7 ЫЗ Сой M ME 3(8) 
Ag ТНТ ЕРЕН СЕ с а ee) 
B = o = ые Б ОКК 
c E ааа 
BC + d т = M TM M TE CE 
ALB HELDEN a 
ALG Lo 00 — ы ооа ч 3(8) 
ABC | — о кою ТИИ 
АВ с ац о И NUN е ШЕН 3(24) 
АС I Jz- ЕСЕТ чл у 5e» 
Авс |+ сз € 2 15295 DE EM NS 
Bu ma ia oo o o Lu ——1—— 
Total | i-o КӨ DUDEN СЕСЕ Mond 


where L refers to linear effect and Q refers to quadratic effect and is not the same Q 
as used previously. The experimenter might argue that the above breakdown is 
not realistic and that one should compare a; with az and the mean of a; and аз with 
ao. There is no reason why this cannot be done if the experimenter desires. The co- 
efficients are the same as in the rows of the above table, but the columns are inter- 
changed. 


IX-44 GENERAL METHODS OF ANALYSIS 


The analysis of partially confounded factorial experiments may be gen- 
eralized to a single procedure if one desires. Analyses of incomplete block 
designs appear to differ only because the shortest and easiest computational 
Schemes are usually presented. Rao [255] and Kempthorne [175, Ch. 18] 
Present general methods of analysis for incomplete block designs. It should 
be noted that these methods are applicable to the incomplete block designs 
Presented in the following chapters, although the general method can be 
greatly simplified for a number of the designs. j 

The purpose here is to present the ideas and not the compulational procedure. 
Consider the data of example IX-2 and suppose that we wish to compute the 
Sum of squares for treatments eliminating block effects without resorting to 
the method described previously. The sum of squares may be obtained from 
the formula У)е:;0:;ь. The Pijn аге the treatment effects for the eight treat- 
ments, The method for obtaining the ®у is often difficult. Since we already 
have the adjusted treatment totals in example IX-2, the treatment effects may 

e estimated from the quantity {Ху — X..../8}/r = Фу. The block 
effect has been removed from the treatment means. The Q;;; values represent 
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the treatment total minus the mean of the block totals containing a particular 
treatment. The computations for the data of example IX-2 are 


Total Sum of blocks contain- 


AQiis ELIT 
Treatment | . X. | ing treatment = Bin | = (Xi — Bu) |= Kan EX8) 
000 50 m —244 —61.455 
100 122 489 EST 0.295 
010 98 487 —95 — 24.045 
110 153 470 142 35.705 
001 96 491 —113 — 26.045 
101 131 460 64 13.705 
011 113 462 —10 — 3.955 
Mi 182 471 257 65.795, 
Total 4(945) 0 0 


The 0 D sue = 2694.886, corresponding, but for a round- 


ing error, to the value 2694.885 obtained in table IX-4. This procedure is 
presented for instructive and not computational purposes, since in practice 
we would have to compute the 2л. Quantities like the Qiz» are used frequently 
in the analysis of lattice designs. Kempthorne [175, Ch. 18] and Rao [255] 
describe the method for obtaining the 2; when the adjusted totals, X.ij1’, 
are not available. 

In order to use Rao’s [255] method of analysis, it is necessary to compute 
the parameters of the design. This is done by noting the number of treatments 
= v, the number of replicates = г, the number of incomplete blocks = b, and 
the number of treatments in an incomplete block = k, and by determining the 
associations among treatments in the incomplete blocks. For the design in 
example IX-2 the following treatments appear together in an incomplete 
block in the various replicates (1, 2, 3, 4): 


There are 

block in eee ee т = 3, which appear together in an incomplete 

то = 3, appear idit M T 1; these are first associates. Three treatments, 

these are second associ in an incomplete block in two of the replicates, № = 2; 
lates. Also, only one treatment, n; = 1, appears with 
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any one other treatment in an incomplete block in three of the replicates, 
Хх = 3; these are third associates. Therefore, Rao's first system of parameters is 


о= 8,5 = 8, k=4,r=4 
м = 1, m = 3; № = 2, m = 3; № = 3, m = 1. 


The second system of parameters has to do with associations of other treat- 
ments with a particular pair of treatments. For example, treatments 000 and 
100 are first associates, № = 1. These two treatments have no first associates 
in common. Two treatments, 010 and 001, are first associates of 000 and 
Second associates, № = 2, of 100. There are no first associates of treatment 
000 which are third associates, Аз = 3, of treatment 100. After the second 
associates and third associates of treatment 000 are compared with the 
associates of treatment 100, the following matrix is obtained: 


0 2 0 ри! ps piè 
рў = 1 2 0 1 |= | ps ps ps 
0 1 0 pa? ри pay 


Likewise, if we consider a pair of treatments which are second associates, e.g., 
000 and 110, and a pair of treatments which are third associates, e.g. 000 and 
111, the following two matrices are obtained: 


2 0 1 0 3 0 
Ds? = 0 2 0 and p; = 3 0 0 
1 0 0 0 0 0 


The above three matrices form the second system of parameters for the gen- 
eral method of analysis described by Rao [255]. 


The two systems of parameters for the experiment described in example 


IX-3 are 
1=12 b=6, r-3 Ё=6; 
20, m=2; №= 2, т = 4; 
№ = 1, n = 4; № = 3, n; = 1. 
О ОО 002 0 
6.5 039 а i ry ОЕА 
рўе ОЕ Lec nans 
1 hoe 230 2 о @ 0 0 
"mart son ero WF 0 
а Ы о ой 
Bu d oes OUR ERU RENS 0 0 4 0 
p О ООО 


Both of the above designs are partially balanced incomplete block designs. 
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IX-4.5 RECOVERY OF INTERBLOCK INFORMATION IN PARTIAL CON- 
FOUNDING - 

In general, it is not recommended that interblock information in incom- 
plete block designs be recovered unless the number of degrees of freedom asso- 
ciated with E,' is greater than 12 to 14. If the degrees of freedom for Е are 
greater than 12 to 14, additional information may be obtained from this source. 
"There are two reasons for the above statement: (i) estimates of variances with 
less than 12 to 14 degrees of freedom are subject to relatively large errors of 
estimation, and (ii) the number of degrees of freedom for a variance composed 
of a linear combination of Е and E, is equal to some value between the de- 
grees of freedom associated with Е, and with E.. Five per cent significance 
levels change rather slowly when the degrees of freedom are larger than 14. 
Also, if the population parameters of Е, and E, are equal, the combined 
variance has degrees of freedom equal to the sum of the individual degrees 

of freedom. 
The analysis of variance for the design of example ІХ -2 recovering inter- 
block (between incomplete block) information is of the form, 


Source of variation df ms Expected value of ms 
Replicate 3 T € 
Treatment (ignoring block) 7 — e" 

Block (eliminating treatment) 4 EV сё dq dog? 
Residual or intrablock 17 E oè 
Total (corrected for the mean) 31 — — 


The expectation of Е, is equal to c? + (r — 1)kes?/r for incomplete block 
designs in which an effect is confounded in one of the r replicates and uncon- 
founded in the other г — 1 replicates with incomplete blocks of size Ё. The 
interactions are evaluated with variance o2 + 4c? in the replicates in which 
they are confounded and with variance c2 in the replicates in which they are 
unconfounded with incomplete block differences. The relative amounts of 
information are the reciprocal of the variances; they are estimated by w 
= (r — J)/(rE/ — E.) = interblock information and by w = 1/E. = intra- 
block information. If the levels of an effect, say (ВС), are weighted by the 
amount of information, then the weighted level of an effect, such as (BC)#+™ 
is obtained from the formula, 


4{w'(BC);,, in replicate I + w(BC);,, in oth i 
ith p l ЕЕ С); ліп other replicates] - (BO): (IX-14) 
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The other weighted effects are obtained from similar formulae. The adjusted 
treatment totals are obtained from the formula, 


Xa = ЦА) + (B); + (С), + (AB ins + (AC Dien 
+ (BC") 44 + (АВО) ад» — 3X....]: (IX-15) 


The double-prime values are not the same as the single-prime values obtained 


Without recovery of interblock information. 1 
The average standard error of a mean difference between two adjusted 


treatment means is 
7 1 Si IX-16) 
за = A; | sacha ivi 


The average effective error variance is 


3 
sr mM 


In the design of section IX-4.1, three replicates might have been selecti 
instead of four; all two-factor interactions could have been confounded w 
the incomplete block differences. The selection of replicates II, III, and I 
would result in such an arrangement. This design, like the design with four 
replicates, is partially balanced, since all two-factor interactions are con- 
founded an equal number of times. Likewise, five replicates may be selected 
80 that ABC is confounded in replicates I and V, and AB, AC, and BC are 
confounded with block differences in replicates IV, II, and III, respectively. 
Such a design would be unbalanced. 

Tf, on the other hand, full information is desired on the interaction AB 
and the main effects in an experiment with four replicates, ABC could be con- 
founded with block differences in replicates I and IV, AC in replicate II, and 

C in replicate III. This is an unbalanced design. The breakdown of the 
total degrees of freedom, neglecting interblock information, is 

MR M MAD EE YS ou o o. 
Source of variation df 
DNE I ein ДУНЕИ S 
Replicate 3 
Blocks within replicates (ignoring treatment) 4 E» 

Main effects from all replicates 3 

4 

1 


Interactions (from unconfounded replicates) 
Error : 


Total 31 


18 
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and making use of interblock information is 


Source of variation df ms Expectation of ms 
Replicate 
Block (eliminating treatment) 4 Ey с? + Зов? 
component (а) 1 oe + 405? 
component (b) 1 а + 20р? 
component (с) 2 o + Зод? 
Treatment (ignoring block) 7 
Error 17 E, сл 
Total 31 


where component (a) is the sum of squares for the interaction of levels of 
effect ABC with replicates I and IV; component (b) is the sum of squares for 
the comparison of the levels of effect ABC in replicates I and IV with the 
unconfounded levels in replicates II and III; component (c) is the sum of 
squares for the comparison of the levels of effects AC and BC in the replicate 
in which the effect is confounded with block differences with the corresponding 
levels of the effect in the replicates in which they are unconfounded with in- 
complete block differences. 

The efficiency [see formulae (IX-6) and (IX-7)] of the above incomplete 
block design relative to the randomized complete block design, assuming the 


confounded interactions are negligible, is 
4E, + (Т + 17)Е, _ (E + 6E) 


(4 4- 7 + 17)Е. TE, ^C 
and not assuming the confounded effects negligible, is 


A((4E — E)/3] + (7 + 17)Е, _ AEV + YE, 
(4 4 7 + 17)Е. 21E. 


The above formulae do not consider the information on interblock compari- 
sons, in which case the efficiency is 


a AE) 1 _2 # axe 
(4E + 1TE)/21/ = ucam $4 w7w af (IX-18) 
o ager T NR 
where ш = 1/E, and w' = 1E; — E, 


IX-5 Fractional Replication 


If the number of treatments of a p^ factorial becomes quite large, а single 
replicate may involve too many experimental units for the available resources. 
In order to get around this difficulty, one may use a fraction of a complete 
replicate. The particular fraction of treatments of a р" factorial to use has Беш 
discussed by Finney [118, 121] and Kempthorne [173] under the topic 9 
fractional replication and by Plackett and Burman [249] under the topic 0 


$ IX-5] . Fractional Replication 269 


optimum multifactorial experiments. Kempthorne [173] showed that the designs . 
suggested by Plackett and Burman are fractionally replicated designs. In 
order to make use of the designs, certain high-order interactions must be 
negligible. Also, fractional replication in factorial experiments is likely to be 
most useful for the comparison of a large number of factors. For example, the 
experimenter might wish to use seven factors each at three levels in all com- 
binations. By using a one-ninth replicate of 37 = 2187 treatments, only 243 
experimental units are required. Two hundred forty-three treatments out of 
the 2187 may be so selected that information is obtained on all main effects 
and two-factor interactions; also, a design of nine incomplete blocks of twenty- 
seven treatments, each using only a single replicate, may be utilized.! Such a 
Scheme is in the realm of practicability. 

The procedure for constructing a fractional replication is closely related 
to the concept of confounding. In the 2^ series the fractional replicate is in 
fractions of 34° for s < n. Thus, a one-half replicate of a 23 involves four 
treatments, and a one-fourth replicate involves only two treatments. Likewise, 
an the 3^ series the fractional replicate consists of fractions of 14° treatments. 
Finney [118, 121] sets up a defining contrast and the aliases. In the 2" system 
the defining contrast consists of the treatments associated with the + signs in 
the highest-order interaction. The aliases consist of setting up equalities of 
effects. To illustrate, consider the 2° factorial. The defining contrast divides 
the eight treatments into two halves, the treatments with + values being 
100, 010, 001, and 111. The A effect is determined from the relation 111 + 100 
= 010 — 001, which is the same contrast as obtained for the BC contrast. 
Therefore, BC is an alias of A. The aliases for a one-half replicate of a 23 
factorial are 


Effect Alias . 
А BC = А хАВС=А?В (mod 2) 
B AC = B XABC = АВ" 2 
С АВ = С хАВС = ABC: E 
I ABC = ABC X ABC = А?В:С+ Г 


In a 2 factorial the half replicate would consist of the treatments 0000, 
1100, 1010, 1001, 0110, 0101, 0011, and 1111, which are the treatments making 


up (ABCD). The effects and aliases are 


Effect Alias 
A A X ABCD = BCD (mod 2) 
B B х ABCD = ACD E 
C CXABCD-ABD “ 
D D x ABCD - ABC o 
AB AB x:ABCD = CD S 
AC AC X ABCD = BD S 
AD AD X ABCD = BC s 
I ABCD X ABCD =I у: 


"See citation to problems. 
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The sum of squares for the comparison of the unthreshed untreated = b; mean with 
the mean of b; and b; is 
[2(334) — 481 — 274) 
24(4+1+ 1) 


and for the comparison of the two unthreshed seed collections in different years 
treated with sodium hypochlorite is 
(481 — 274)? 
24(1 + 1) 


Another comparison? of interest was the one for the treated and untreated samples 
from 1943 collected seed; thus: 
(334 — 274)? 
24(1 + 1) 


The interaction of varieties and seed treatments is larger, F = 124.77/24.23 

= 5.15 > Fon than is expected under the hypothesis of no interaction. The variety X 
treatment sum of squares may be partitioned into three sets of 7 degrees of freedom, 
corresponding to the interaction of the three contrasts with varieties. All are significant 
at the 5 per cent level, even though the contrast of threshed with unthreshed accounts 
for a large proportion of the interaction sum of squares. The calculation of these sums 
of squares is left as an exercise for the reader [273, Ch. 15]. 

| The correct error term for testing differences between variety means is error (a) 
if it is assumed that these particular seed treatmenis are not a random sample of seed 
treatments but are the only ones of interest.? If the seed treatments are considered to be 
a random sample, no appropriate error, as such, exists. The F test of the hypothesis 
of no difference among varieties in the germination of 1200 seeds with the four seed 
treatments by, 01, b», and з із F = 109.02/98.38 = 1.11, which is near the mean value of 
F[E(F) = error df/(error df — 2) = 14/12]. The replicate mean square is small but 
not significantly so, F = 98.38/19.29 = 5.10, which is less than the tabulated 
F( = 19.42) value at the 10 per cent level of probability for a two-tailed test. 

The choice of the experimental error variance for comparing the variation among 
mean germination for the four seed treatments depends upon the hypothesis being 
tested [126, sec. 65; Chapter V]. Zf the eight varieties conslitule a random sample from a 
population of guayule varieties and if а single seed treatment is being recommended for 
all guayule varieties, the A X B interaction mean square is the appropriate error 
for making an F test; thus, F = 10,258.09/124.77 = 82.2 > Fo = 4.87. If, on the 
other hand, these eight guayule varieties represent the only varieties of inlerest and if 
the recommended seed treatment is for the eight varieties, then the error (6) mean 
square (24.23) is the correct error for comparing the seed-treatment means. In either 
case the interpretation is the same for this example. р 

A third situation may arise in which it is assumed that the eight varieties consti- 
lute a random sample of guayule varieiies, but a different seed treatment will be recom 
mended for each variety, since there is known to be a variety by seed-treatment inter- 


1This comparison is not independent of the above three comparisons. i 

*If one is interested in testing variety differences, not solely differences among CE 
the error (а) mean square is the appropriate mean square for either the finite or the in 
model. 


= 52.56, 


= 892.69. 


= 75.00. 
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action. In this case the correct experimental error mean square for a particular mean 
T.i; is the error (b) mean square; the standard error of ži; is error (8) msgr- 

Standard errors of a mean difference are available [60, 219, 287] for the several 
comparisons that may be made among the 4 X 8 = 32 seed-treatment and variety 
combinations. The standard error of a difference between two variety means on а 
split plot basis is 


__ БЕ, _ [98.38) _ l 
ч = ATE Bay sd = “05 (Х-2) 


where E, = error (а) and Ep = error (b) mean squares. 
The standard error of a difference between two seed-treatment m on a single 
plot basis for the first hypothesis cited above is 


ЛЛ 
_ _ , Pinteraction ms) _ [PUAT Sy e o 
ES pr 8(3) f X i b ag 


ч 
and for the second case given above is Š | E \ $ 
BE [pum CU 
чт Мазар MEN 750 


The standard error of a difference between two b (see 
level of the factor a (varieties) is 


PENEAN PEED ctun 
г 3 


The standard error of a difference between two a (variety) means at the same 
level of the factor b (the same seed treatment) is 


n JE ZDE, +E] [39:23 +9838] _ 5.34. (х-б) 
rq 3(4) 

The degrees of freedom for the standard errors of a mean difference given by 
formulae (X-2) to (X-5) are those associated with the particular mean square used. 
However, the number of degrees of freedom associated with sq from formula (X-6) 
must be approximated [55, 287]. 

The above standard errors are appropriate if the whole plot treatments have been 
randomized within the complete blocks. If a systematic arrangement of the varieties 
had been used, then only the second, third, and fourth standard errors (formulae 
(X-3) to (X-5)) of a mean difference would be applicable. The first and fifth standard 
errors would not be applicable for a systematic arrangement of the varieties. Because 
of these complications, systematic arrangements of the whole plot treatments are to 
be avoided. 

The efficiency of this split plot design on the B and A X B comparisons relative 
to what it would be for the thirty-two combinations allotted at random to the thirty- 
two plots in each replicate is estimated to be 

(7 + 14)(98.38) + (3 + 21 + 48)(24.23) _ 40.97 _ убо mercer’ 
93 (24.23) 24.23 
and for the A comparisons is 40.97/98.38 = 42 per cent. On the former comparisons 
а gain in precision of 69 per cent was obtained, while for the latter comparisons, the 


нн means UY 
wus * 7 


(Х-5) 
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variety comparisons, this design was less than half as efficient as the randomized 
complete block design. 


X-1.4.2 Whole plots in a latin square design. In some instances 
depending upon number of replicates and the experimental conditions, the 
whole plots may be arranged in a latin square design. By so doing the compari- 
sons on the A effects or whole plot contrasts may be more precise than if the 
pq combinations of the factors a and b had been laid out in a randomized 
complete block design, since the latin square design is often more efficient than 
the randomized complete block design [201, 319]. The breakdown of the total 
degrees of freedom in the analysis of variance is 


Source of variation df ms 
Row p-l Er 
Column p-l Е, 
А = whole plot treatments p-l 
Error (a) (p ~ 1)(p — 2) E, 
B = sub-plot treatments 9—1 
АХВ (р – 1)(9 — 1) 

Еггог (Ь) p(p — 1)(9 – 1) E, 
Total 9р? — 1 


А —————————____ 


For the comparisons among whole plot treatments the efficiency of this design 
relative to a randomized complete block is estimated by 


Ег E (E. + (p — 1)Е,)(р — 1)/р + plq - DEs Р 
a (pq — DE, > (X-7) 


From experimental evidence on field crops [201, 319], it has been found 
that comparisons of whole plots arranged in a latin square are almost as precise 
as if a randomized complete block design had been used for the pg treatments. 
Thus, little information may be lost on whole plot comparisons and con- 
siderable gains in information may be made on sub-plot comparisons by design- 
ing the whole plots in a latin square. This feature adds considerably to the 
attractiveness of split plot designs. 


X-14.3 Whole plots and split plots arranged in latin square 
designs. Another variation of the split plot design is to arrange the treat- 
ments so that each split plot treatment appears once in each order in each 
г eplicate for a particular main plot treatment. For example, the illustrative 
er М Х-1 could be designed аз in table X-4, where the split plot 
fenis of rd all orders in the three replicates. Four of the twelve arrange- 
the four t 3 latin square are selected at random and laid out in each 0 

т treatments ao, а, аз, and аз. In such a design the ordinary error 


sum of squares is partitioned i i ithi Е 
Я into the two : dox 
and the residual variation. rw 


§ X-1.4.3] Whole Plots and Split Plots in a Latin Square 281 


If the split plot treatments are equal to the whole plot treatments in 
number and are arranged to occupy all orders within whole plot treatments 
for a particular replicate, the sum of squares for orders also represents one 
component of the А X B interaction. Such an arrangement is undesirable if 
the A X B interaction component is not equal to zero. If the whole by split 
plot interaction is nonexistent, it may be desirable to confound one of the 


TABLE Х-4. Field lay-out for a split plot design with the whole plots in a randomized 
complete block arrangement and the split plots in a latin square arrange- 
ment within whole plot treatments 

Replicate III 


аз LE] 


Replicate I Replicate Ii 


Analysis of variance 


Source of variation 
Replicate 

A = whole plot treatment 
Error (a) 


B = sub-plot treatment 
AxB 
Orders within each whole plot 


Residual = error (b) 


Total 35 


сососую aun |8 


components of the A X B interaction with orders within whole plot treat- 


ments for each replicate to control this source of variation. 
For p = q, a variant of the design of section X-1.4.2 is to arrange the whole 


plot treatments in a p X.p latin square and to arrange the splitplot treatments 
in orders within each treatment. The form of the analysis for whole plots is 
the same as described in section X-1.4.2, and for the sub-plots as given in 


table X-4. 


In designing the whole or split plots in latin squares, account should be 


taken of the number of degrees of freedom associated with the error (a) and 
error (b) mean squares. For small values of p the decrease in the error mean 
Squares may be offset by the loss in the associated degrees of freedom. For 
the design described in table X-4, only 8 degrees of freedom are available for . 
the residual mean square, whereas 16 degrees of freedom are available if 
Orders are ignored. For a number of experiments the decrease in the error mean 
Square may be more than offset by the loss in degrees of freedom, resulting 


ìn less precise comparisons. 
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Х-1.4.4 Repetitions of experimental results. A particular type of 
experimentation involves repetition of the treatments in a design (for ex- 
ample, the randomized complete block, the latin square, or other designs). 
over several intervals of time, over several locations, or over several groups 
of material. It might be argued that these designs are not true split plot 
designs, but since the design and analytical features are of the same nature 
as split plot designs, there is little reason to set up a separate category for 
these designs. 

Warriner [303] conducted an experiment on the readability of fifteen high 
school physics textbooks. Sample passages were selected from ten comparable 
. parts of each book and three students were assigned to take tests on a given 
part of every book. In all, thirty students participated in the test. The key- 
out of the degrees of freedom for this design is 


Je 

Source of variation df ms 
— NNNM 

Part of book 9 — 

Among students within parts 20 E, 

Text 14 — 

Text X part of book 126 — 

Text X student within parts 280 E, 
ia. ———— КДЫН 

Total 449 — 


The mean square for parts of the book are compared with Z,. The mean 
squares for texts and the text by part of book interaction are each compared 
with Es, since these texts and parts of text represent the only items of interest. 

А design similar to that described above is the randomized complete block 
experiment repeated at several locations. The numbering of the replicates at 
the various locations is purely arbitrary, just as was the numbering of the 
students in the above experiment, If location, treatment, and replicate are 
substituted for part of book, textbook, and student, respectively, the key-out 
of the degrees of freedom for a randomized complete block design laid out at 
several locations has the same analysis of variance as given above (see ex- 
ample X-2). The tests of significance are the same if a treatment is to be rec- 
ommended for each location. If the locations are considered as a sample of 
locations, the location X treatment interaction is used as the error mean 
Square (see table X-16). 

Similar analyses are suitable for annual crops repeated over several time 
periods (for example, years) at one location or at one laboratory. However, for 
i aiu Crops repeated over several years, for several harvests of one croP 
üt ay oe Hs ш scorings by judges, the analysis is dail 
follow o ecd actorial, but the tests of significance are different. 

egrees of freedom in the analysis of variance for y years 
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= A effect and £ treatments = B effect in r replicates of a randomized com- 
plete block of a perennial crop is 


Source of variation df ms 
Replicate r-l — 
Year = A y-1 = 
A X replicate (г – 1)(у – 1) Е, 
Treatment = B t—1 — 
B X replicate (т — 1)(t — 1) E, 
AXB Yy-)Dt-) => 
A X B X replicate (r — D(y — I(t — 1) Е, 


Total ryt —1 — 
a OQ cá MEM 


The appropriate error mean square for a particular comparison depends upon 
the nature of the experimental material and the hypothesis tested [47, 273, 
279, 291]. One way of looking at the above experiment is to consider that the 
experimental unit is represented by the total yield obtained in the y years. 
The analysis then reduces to that for an ordinary randomized complete block 
design. If, on the other hand, the above form of the analysis of variance is 
desired and if the years are considered to be the only ones of interest, then 
E, is the appropriate mean square for testing the treatment mean square. 
Likewise, for these particular treatments, E, is appropriate for testing the А 
mean square, even though only one sample is available on any one year. For 
the infinite model, no appropriate error mean square is available for testing 
the 4 and B mean squares; if desired, it is possible to construct a synthetic 
mean square, and to compute the approximate number of degrees of freedom 
associated with it [55, 287]. | 

If a single experiment is harvested several times in a single season or if 
Several readings are taken by a judge or scorer, the above discussion on per- 
ennial crops applies here as well. ч З 

In the analyses of experiments the question arises concerning the inde- 
Pendence of the results obtained from observing a particular experimental 
Unit several times, For example, in a forage experiment a high yield of the 
experimental unit in the first cutting may result in a lower yield, relatively, 
for the sécond cutting because more available soil resources were utilized 
during the first growing period, leaving less for the plants during the second 
Period. Correlation of subsequent observations affects tests of significance in 
the analysis of variance. One method for investigating the independence of 
results from experiments of this nature is to use some form of multivariate 
analysis [279, 291]. However, if total yields over the period are used, the 
Correlation from cutting to cutting (competition in the broad sense) may be 
Ignored. From data on forage crops (grasses) there is some evidence to indicate 


that the harvest-to-harvest correlation within one year may be appreciable 


19 
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experiments, 

Cochran [47], Cochran and Cox [60], Lowe [197], Steel [279], Tukey 
[291], Wishart and Hines [315], and Yates and Cochran [332] present dis- 
cussions on analyses for several repetitions of an experiment, The paper by 
Yates and Cochran is especially enlightening. 


Ezample Х-9; Twelve districts of approximately equal size are set up in Iowa 
for the purpose of conducting corn yield trials [111]. Yield tests on corn are conducted 


differ from region to region. The entries from Year to year seldom remain the same, 
although a few are tested more than one year. 

In Districts I and II, six double-cross corn hybrids were tested in 1942 and in 
1943. The yield data for 1942 are given in table X-5, and for 1943 in problem X-1. 
The trials were grown on a farm (supposedly selected at random) within each district, 
the farm being designated as a location. The design was a randomized complete block 
with four replicates in each district each year. The plot size was 2 X 10 hills of corn 


plot. The replicates were numbered І, IT, ТЇЇ, and IV in District I and V, VI, VII, and 
VIII in District II ; replicate I in District I has nothing in common with the first 
replicate in District IT. The analysis of variance for each of the districts and for the 


27,027.88 + 33,870.11 — 60,705.19 — 192.80 — (805.4 — 901.6)2/48, 
with а single degree of freedom, The total sum of Squares corrected for the mean is 
27,172.12 + 34,442.80 — 60,705.19 = 909.73. 


The variety x replicate within location or Place sum of squares is the sum of the sums 
9! Squares for the individual analyses, 56.27 + 287.33 = 343.60, with 15 + 15 = 30 


anes of freedom. The replicates within location sum of squares is the sum of the 
Picate sum of Squares for the Separate analyses, 


Th т TE P EH with 5 4 a АШАДЫ 
е variety within locatio; at 
же d eu Ee x Squares, 70.36 4- 240.65 — 311.01, contains 


: cation sums of squares, The former is equal t° 
263.6? + 309.52 4+... + 284.52 
TM 309.57 4 i 


8g o — = 60,705.19 = 191.51, 


Чзәш лоу u0]4322110) 


Suo; 


39207 оүцатл ѕәзвәүтаәх х praqky 
Чоузв2от X ppiqfg 

Praqky 

SUOTZBOOT џүцұүл sojwop[doy 
UOT49201 


9U6t «сг 


512403 
5боләәтапот 
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and the latter is 311.01 — 191.51 — 119.50; each sum of squares has 5 degrees of 
freedom. The sum of the above sums of squares should add to the total sum of squares 
within rounding errors. 
'Tests of significance may be made in the manner discussed previously. If it is 
assumed that the fields on which the six hybrids were planted represent a random 
sample of fields or locations, and if it is desired to recommend one hybrid for the area 
comprised by Districts I and II, the interaction mean square, 23.90, is an unbiased 
estimate of the experimental error for comparing the differences among hybrid means. 
The F test, F = 38.30/23.90 = 1.60, indicates little evidence for rejecting the null 
hypothesis. However, further inquiry into the data indicates that the relatively large, 
though not significant, interaction mean square is mainly due to the relative difference 
in yields for hybrid 1-1 at the two locations (figure X-1). Hybrid 2-2 was highest at 
both locations, and hybrid 8-38 was second highest at both locations. 


District II 


& 8 б 


150 
145 
2 
140 ya 
^7] District! 


© 
© 


© 
© 


Total yields from four replicates (Ibs. 


x] 
© 


120 
15-43 4-3 7-39 1-1 8-38 2-2 
Hybrid numbers 
Figure X-1. Graphical representation of yields of six 
hybrids in two districts 


The F test of the error variances at the two places, F = 19.16/3.75 = 5.11 > 
Fos = 2.86, indicates heterogeneity of the error variances. Despite this, the pooled 
error may be considered the best estimate of the local experimental error, since the 
coefficients of variation, 


244/3.75 
us — 6 per cent for District I 


244/19.16 
^ 90Lg = 12 per cent for District П, 


and 
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are expected to be about. 8 to 12 per cent for corn yield trials in Iowa, and District I 
appears to have a smaller error mean square than would usually be expected. Unless 
some explanation is available as to the reason for the difference in the two error vari- 
ances, the experimenter may justifiably regard the pooled-error mean square, 11.45, 
as the appropriate one to use in making tests of significance. The analysis of data 
assuming unequal variances has been discussed to some extent in previous chapters 
[179; 273, p. 83; 279]. 

In making recommendations for the area for which a random sample of two fields 
is available the standard error of a difference between two hybrid totals is 


sa = VAD O5.90) = 19.56, 
and for making recommendations for each place separately the standard error is 
зз = VEO) ULES) = 13.54. 


Х-1.5 MISSING DATA 

In the event that a whole plot yield is missing, the appropriate formula 
for estimating the yield is determined by the design used for the whole plots. 
Tf a randomized complete block design is used, the appropriate formula is 
(V-5). If the whole plots are designed in a latin square, formula (VI-20) is 
the appropriate one. The missing whole plot is estimated, and the whole plot 
analysis is completed as prescribed by the particular whole plot design. The 
sub-plot analysis for a design of the type given in table X-1 is obtained by 
the method of fitting constants [273, Ch. 11; 318a]. 

The formula for estimating the yield of a missing sub-plot unit for the 
design in table X-1 has been developed by Anderson [3] and Khargonkar [181]. 
If the total yield of the whole plot with the missing sub-plot is designated as 
Xain the total yield of the ijth treatment аз X... and the total yield for the 
ith whole plot treatment as X.;., from an experiment of r replicates, p whole 
plot treatments, and g sub-plot treatments, the yield of a missing sub-plot 
may be estimated from the formula. 

гХы. + qXaj Kai, -8 
Kei! = ү —1) SERE 


The estimated value is inserted in the table of yields, and the analysis is com- 
Pleted in the same manner as for no missing yields, except that 1 degree of 
freedom is subtracted from the error (b) degrees of freedom for each value 
estimated, If two or more sub-plot units are missing, one may use the iterative’ 
Procedure described in Chapters V and VI, or one may derive the exact 
formulae for estimating the particular combination of missing values obtained 
In the experiment. 

Anderson [3] discusses the resulting biases in the mean squares; all mean 
Squares except error (b) are slightly overestimated [3; 60; p. 229]. Also, these 
authors [3, 60] present standard errors for particular treatment contrasts. 

Tf several missing observations occur in designs with three-way and higher 
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classifications, reference may be made to the works of Hazel [152], Henderson 


[156], Krishna Iyer [190], Quenouille [251], Stevens [281], Wilks [307], and 
Yates [318, 318a]. Р 


X-2 Тһе Split Block Design 
X-2.1 INTRODUCTION 


In some cases the treatments a; and the treatments b; may be of relatively 
little interest compared to the A X B interaction, In other situations, it may 
be difficult or impossible to carry out the experiment in the manner illustrated 
in table Х-1. Examples of the latter situation are fertilizer and tillage treat- 
ments with ordinary farm implements; spraying treatments on experiments in 
randomized complete block: or latin square designs in the field; etc. To cope 
with situations such as those described above a design variously denoted as 
split blocks [193, 318], two-way whole plots, strip trials [181, 219], or subunits 
in strips [60] has been proposed. In this design the treatments a; are laid out 
in a randomized complete block, latin square [297, 318], or other design, and 
then the treatments b; are laid out across all of the a treatments in one repli- 
cate (table X-7). Such a design would effectively have the a treatments and 


the b treatments designed' as whole plots; the only sub-plot information is 
on the A X B interaction. 


X-2.2 ADVANTAGES AND DISADVANTAGES 
The advantages of the design are 
(i) The subunits may be kept relatively small, even though the whole plots for 


both factors must be relatively large experimental units. 
(ii) Precise information is obtained on the A X B interaction. 


The disadvantages of the design are: 

(i) Information on the A and B effects is less than is available from the ordinary 
randomized complete block design. 

(ii) The analysis of the design is more complex than the ordinary randomized 


complete block design. This is especially true for nonorthogonal situations and 
for covariance analyses. 


X-2.3 RANDOMIZATION AND EXPERIMENTAL LAYOUTS 


The randomization procedure for the a treatments follows that for the 
particular design in question. The randomization procedure for the b treat- 
ments follows that for the selected design for these treatments. Two examples 
of the experimental layout are given in table X-7. 

X-24 ANALYSIS 

4 Two field plans and the breakdown of the degrees of freedom in the analysis 

i E (0 еасһ design are given in tables X-7(a) and X-7(b). The first 
ustrates the design and analysis for an experiment in which the main 
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TABLE X-7. Field designs and analyses of variance for a split block experiment 
with three levels of factor b, four levels of factor a, and three replicates 


(a) Factors a and b in randomized complete block designs 


Replicate I Replicate II Replicate III 

а, а а % зу э, ау 8 в 8 а; а 
bo b. by 
% br vo 
E vo *o 


Analysis of variance 


Source of variation ar ns 
Replicate 2 ” 
А 3 - 
Replicate x A » error (a) 6 EA 
B 2 E 
Replicate x B » error (b) 4 E, 

6 - 
Error (c) 12 E. 
Total 25 


(b) Factor a in a randomized complete block and factor b in a latin square design 
Replicate III * 


Replicate I Replicate II, 
кз uk Qu 8 ао Gy US C QA Sp 
b, 
2 by *o 
o bo bi 
с *o ba 
Analysis of variance 
Source of variation af ns 
Replicate 2 - 
A 5 - 
Replicate x А = error (a) 6 E. 
B 2 c 
Order within replicates 2 c 
Error (b) 2 E 
6 - 
AxB 
Error (c) 12 Е. 
Total 35 


effects are in randomized complete block designs with each of the levels of 
9ne factor running across all levels of the second factor. The comparisons 
among levels of both the a and b factors are made on three replicates and three 
Teplicates only. The standard error of a difference between two means on a 


Plot basis for the a; levels is 
NDS (X-9) 
rq 0x 
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classifications, reference may be made to the works of Hazel [152], Henderson 


[156], Krishna Iyer [190], Quenouille [251], Stevens [281], Wilks [307], and 
Yates [318, 318a]. 


X-2 The Split Block Design 
X-21 INTRODUCTION 


In some cases the treatments a; and the treatments b; may be of relatively 
little interest compared to the A X B interaction, In other situations, it may 
be difficult or impossible to carry out the experiment in the manner illustrated 
in table X-1. Examples of the latter situation are fertilizer and tillage treat- 
ments with ordinary farm implements; spraying treatments on experiments in 
randomized complete block: or latin square designs in the field; etc. To cope 
with situations such as those described above a design variously denoted as 
split blocks [193, 318], two-way whole plots, strip trials [181, 219], or subunits 
in strips [60] has been proposed. In this design the treatments a; are laid out 
in a randomized complete block, latin square [297, 318], or other design, and 
then the treatments b; are laid out across all of the a treatments in one repli- 
cate (table X-7). Such a design would effectively have the a treatments and 


the b treatments designed’ as whole plots; the only sub-plot information is 
on the A X B interaction. 


X-2.2 ADVANTAGES AND DISADVANTAGES 
The advantages of the design are 


(i) The subunits may be kept relatively small, even though the whole plots for 
both factors must be relatively large experimental units. 
(ii) Precise information is obtained on the А X B interaction. 


The disadvantages of the design are: 

(i) Information on the А and B effects is less than is available from the ordinary 
randomized complete block design. 

(ii) The analysis of the design is more complex than the ordinary randomized 


complete block design. This is especially true for nonorthogonal situations and 
for covariance analyses. 


X-2.3 RANDOMIZATION AND EXPERIMENTAL LAYOUTS 


The randomization procedure for the a treatments follows that for the 
particular design in question. The randomization procedure for the b treat- 
ments follows that for the selected design for these treatments. Two examples 
of the experimental layout are given in table X-7. 

X-24 ANALYSIS 

d Шо? iin and the breakdown of the degrees of freedom in the analysis 

E iC id each design are given in tables X-7(a) and X-7(b). The first 
Шиз{га{ез the design and analysis for an experiment in which the main 
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TABLE X-7. Field designs and analyses of variance for a split block experiment 
with three levels of factor b, four levels of factor a, and three replicates 


(a) Factors a and b in randomized complete block designs 


Replicate I Replicate II Replicate III 
"э ux tome 210080: 625 53 Ga vh Sep ta 
ъ, 
2 bs by 
b, 
о bi by 
b. 
1 % % 
Analysis of variance 
Source of variation of ns 
Replicate 2 - 
A 3 = 
Replicate x A = error (a) 6 E. 
B 2 - 
Replicate x B » error (b) 4 E 
AxB 6 - 
Error (c) 12 E, 
Total 55 


lock and factor b in a latin square design 


(b) Factor a in a randomized complete b 
Replicate III * 


Replicate I Replicate II, 

woe te CON A en за o re 
ba А % 
do b. mh 
эү bo by 


Analysis of variance 


Source of variation af ms 
Replicate : = 
A - 
Replicate x A = error (a) 6 EN 
B 2 = 
Order within replicates 2 - 
Error (b) 2 E 

6 - 
Error (c) 12 E. 
Total 35 


effects are in randomized complete bl 
Опе factor running across all levels o 
among levels of both the a and 
Teplicates only. The standard error © 


Plot basis for the a; levels is 
[2E BE. Q- 
rq ? : 


ock designs with each of the levels of 
f the second factor. The comparisons 
b factors are made on three replicates and three 
f a difference between two means on a 
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and for the b; levels is 2E, _ [2ЕЬ (Х-10) 
Vip ~ Viz" 


where г = number of replicates, g = number of levels of factor b, and p 
— number of levels of factor a. 

In the second design the order of the b treatments within a replicate is 
taken into account. This type of design may be used when there is a gradient 
from one replicate to another and when the number of replicates equals the. 
number of treatments. Since there are only 2 degrees of freedom associated 
with the error sum of squares in a 3 X 3 latin square, it is inadvisable to use 
this design for three replicates. The standard errors of a mean difference are 
obtained by the same formulae given above except that E, and E, are ob- 
tained from table X-7(b). 

The F test of the А, B, and A X B effects employs Ea, Es, and E., re- 
spectively, as the experimental error mean squares. It is expected that the 
error (c) mean square, E., will generally be smaller than E, or E;, since it 
represents intrablock or intra-whole plot variation, while the other two repre- 
sent interblock or inter-whole plot variation. Thus, the A X B interaction 
is estimated more precisely than are the main effects A or B. The increased 
accuracy on the A X B interaction is obtained by sacrificing accuracy on the 
whole plot comparisons. If the pg = 12 treatments of table X-7(a) had been 
completely randomized within each of the replicates, the А and B elfects 
usually would be more accurately estimated and the A X B interaction less 
accurately than with the present design. 

Despite the importance of some main effect comparisons, it may be im- 
practical to design an experiment in any way other than in a split block 
design. In a pasture experiment, eight methods of cultivation and four cutting 
treatments (time and height of cutting) were studied to observe the effect of 
the treatments on forage yields. Regular farm machinery was used in the ex- 
periment, thereby necessitating the use of large plots. Four replicates of ihe 
thirty-two treatment combinations were used. If thirty-two plots had been 
included in each randomized complete block, the entire experiment would have 
been spread over a large area. Therefore it was decided to have smaller plots 
by using a split block design, the first replicate of which follows: 


Methods of cultivation 
2 В 5 li 4 8 T 


Cutting 
treatments 
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The use of smaller plots allows the choice of a more homogeneous area for 
the complete block and requires that less material be weighed and handled. 
Another advantage of this type of design for field experiments is that the mini- 
mum amount of turning of farm machinery is required. 

The disadvantage of the above design is the same as for all split block 
designs, i.e., the main effects A and B are estimated with the least precision 
and the A x B interaction with the most precision. In the above example, 


: there are only four replicates on cutting treatments and on methods of culti- 


vation, 
Nair [219] and Cochran and Cox [60] present standard errors for contrasts 
other than those presented above. 


X-2.5 MISSING DATA 

A missing datum for a subunit in a split block design may be estimated in 
the manner described by Khargonkar [181]. A missing datum for an entire 
whole plot would be estimated by the method appropriate for the particular 
design used. 


X-3 The Split Split Plot Design and 
Further Subdivisions 


ns (ann X q X p factorial 
o an experimenter where 
interest, and c and the 


Several levels of three factors in all combinatio 
arrangement of the factors) may be of interest t 
factor a is of little interest, b is of somewhat more 
interactions with the factor c are of most importance. In such a case the whole 
plots containing the factor а would be split for the b treatments. The split 
plots containing the b treatments would then be further split for the levels of 
factor c. The incomplete block containing the comparison of the levels of 
factor c and the levels of the interactions with c are made as homogeneous as 
Possible, since these comparisons are of most importance. 


The experimental layout and analysis are presented in iable X-8 for а 
1s of the factor а are randomized within 


jn each level of factor а the q = 3 levels 
and then within each level of 6 
the split split plots at random. 
only levels of the factors а, b, 
he A effect mean square із Fa, 


X C effects is Æ.. In general, it is expected tha 
Precision with which the split split P 
than for the other comparisons. The B 
With more precision than are the А comparisons, just as they are in an ordi- 


Nary split plot design. 
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TABLE X-8. Field design and analysis of variance for a split split plot design 
of ngp = 4 X 3 X 2 = 24 combinations of the factors a, b, 
and c in three replicates 


Replicate I Replicate II 


Analysis of variance 


Source of variation аг 
Replicate 
А 


Вер. х А = Еггог (а) 
в 

Ахв 

Error (b) 

с 

Ас 

BxC 

AxBxC 


К) 


н.о e 
Ban aun 


c 
с 
с 
(3 
с 


ео 
ОСО ы 


The standard errors of a difference between two means on a split split plot 
basis are 


for the comparison of two levels of factor a 


J£ (X-11) 


for the comparison of two levels of factor b 


25, 
npr' (X-12) 
for the comparison of two levels of factor b at the same level of factor a 
2E, 
4 5 (X-13) 


for the comparison of two levels of factor a at the same level of factor b 


Fa - DE FE, FEJ K-14) 


qp 
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for the comparison of two levels of factor c 


ARE. (X-15) 
ngr 


for the comparison of two levels of c at the same level of a 


2Е. (X-16) 


гӯ 
for the comparison of two levels of a at the same level of c 


2[(р — DE. + Eo) (X-17) 
rpq 


for the comparison of two levels of c at the same level of b 


ФЕ; (X6) 
rn 


and for the comparison of two levels of b at the same level of c 


ESTEE лө) 
гпр 


Тһе number of degrees of freedom associated with the standard errors given 
in formulae (X-11), (X-12), (K-13), (K-15), (X-16), and (X-18) is the same 
88 for the error mean square used. The remaining standard errors have an 
unknown number of degrees of freedom associated with them, but the number 
of degrees of freedom may be approximated [55, 287]. 
The split split plot design may be necessary because of the nature of the 
experimental material rather than because of the desire of the experimenter 
Ог more precision on some of the comparisons by sacrificing accuracy on 
Others. If this is true, the experimenter may change the plot shape and in- 
Complete block shapes in order to increase the accuracy on the other factors. 
or example, suppose that it is impossible to lay out the levels of a in any other 
than large plots. The experimenter can choose the shape of whole „plots 
(usually long and narrow) and the shape of the replicate (usually square, ла the 
absence of any knowledge of soil variation) so as to have the best comparisons 
Possible on the factor a. Or, the factor b may be of considerable importance; 
ìn this case the whole plot should be as nearly square as possible, with reo- 
‘angular split plots. This gives the best comparisons on the B and A x B 
effects, In the third instance, if the с factor and interactions are of most im- 
Portance, then the split plot should be square or nearly so, with the split 
Split plots being rectangular in shape. From the above considerations, then, 
the experimenter may change the precision on certain comparisons by chang- 
‘ng plot shape, even though the experimental material requires a split split 


Plot design, 
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Further examples of split split plot designs are afforded by experiments 
conducted at two or more locations over a period of years for either an an- 
nual or a perennial crop. Of course, the analyses for annual crops and for 
perennial crops are different, but the concepts are similar. Experiments har- 
vested more than once during the year for two or more years [197] represent 
another example of this type of design [section X-5.3]. 

For some experiments, further subdivision may be desirable either be- 
cause of the additional information on the factor or factors in the smallest 
unit or because of the ease or timeliness of application of the additional fac- 
tors. Suppose that r replicates of n irrigation treatments (I) each with p 
varieties (V) as the sub-plot treatments are available. Suppose that it is 
decided to apply fertilizer treatments to the above design after the experiment 
has been laid out as a split plot design. Suppose further that the fertilizer 
treatments are composed of q levels of nitrogen (N) and s levels of potash (К) 
in all combinations, and that the q X s treatments are applied to the split 

TABLE X-9. Key-out of degrees of freedom for a split split split plot design 


Source of 


Source of 
variation af шз 


variation at mo 
Replicate r-i = Fungicide (ғ) t-l S 
Irrigation (I) n-l - Ix (n-2) (£1) - 
Error (a) (r-1)(n-1) EN VF (p-1)(t-1) - 
Variety (V) pel = IxVxP (1-2) (p-1) (5-1) - 
IxY (2-1)(z-1) - Р (a-1) (5-1) - 
Error (b) n(r-1)(p-1) E, ЮР (5-1) (t-1) - 
Nitrogen (N) а-1 - НхКхг (2-1) (5-1) (6-1) - 
' Potesh (К) s-l = Inter (n-i) (4-1) (4-2) - 
ЫК (9-1) (s-1) - (n-2) (s-1) (t+1) = 
Tx (n-1)(a-1) - (n-1) (a-1) (s-1) (&-1) - 
Ink (n-1)(s-1) - {р-1)(ә-1)(%-1ї) - 
тык (п-1)(а-1)(в-1) = G-1)G-2)6-3) ^ - 
Ух (2-1) (9-1) - {P-1)(q-1)(s-1)(t-1) | - 
Va (p-1) (s-1) = IxVxHxP (2-2) (0-1) (a-1) (6-1) = 
Уак (р-1)(а-1)(в-1) - ІХУХКХР (8-1) (р-2) (2-1) (а) = 
Ыру (n-1)(p31)00(00) = Тху — (n-1)(p-1)(q-1)( s-1)(t31) ~ 
IxVxK (n-1)(p-1)(s-1) - Error (а) npas(t-1)(r-1) E 
IxVxN«K (a-1)(p-1)(e-1)(s-) = rc MER NEN. ~ 
Error (c) np(as-1)(r-1) E Total rnpast = 1 


split plots at random within each split plot. In addition, suppose that each split 


split plot is subdivided further to allow the application of 1 fungicide spraying 
treatments (F); i.e., each split split plot is subdivided into i овна and the 
i fungicides are randomly allotted to the split split split plots within each 
split split plot. The key-out of the degrees of freedom for this design is give? 
in table X-9. In this particular design and for the finite model, “four error 
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terms, Ea, Es, Ee and Ез, are available for comparison with the other mean 
squares. E, is used to test the variation among irrigation treatments; E; is 
used to test the existence of varietal effects and of the J X V interaction; 
E. is compared with the mean squares for nitrogen, for potash, for their inter- 
action, and for the interaction of these fertilizers with varieties and irrigation 
treatments; and Ег may be compared with the mean squares for fungicides 
and for all interactions involving fungicides. For the infinite model, synthetic 
error variances must be constructed to test а number of the mean squares 
[55]. However, for treatments of the nature described here the finite model 
18 usually more appropriate. 

. lf one of the effects, say irrigation, is large compared to the other effects, 
it is possible that all interactions with this effect will be significant, and 
the significance of four- and five-factor interactions usually is difficult to 
explain. Since the components making up the various mean squares probably 
are heterogeneous, a combined analysis assuming homogeneity of errors may 
be undesirable. Because of this а separate analysis is computed for the plots 
from each irrigation treatment. Relatively large four- and five-factor inter- 
actions may merely indicate that it is incorrect to generalize over a wide range 
of certain treatments. This is especially true in experiments where one of the 
factors, such as water, exerts a dominating influence. 


X-4 Some Variations 


Some modifications of the split plot design are illustrated in problems 
-9 and X-8. The number of replicates at the various locations is unequal for 
the experiments in both problems, but this feature does not complicate the 
analysis unduly. As long as the number of replicates for the treatments or 
Varieties remains the same at each location, the feature of proportionality is 
not affected. Problems X-7 and X-9 present a more complicated variation of 
the split plot and the split block designs, respectively. In these cases the 
latin Square arrangements are used for both the whole plot treatments and 
the split plot treatments. Hedayetullah et al. discuss split block designs with 
Split plots [154]. When the split plot treatments represent a factorial arrange- 
ment, they may be put into incomplete blocks within the whole plot. Con- 
founding some split plot comparisons with incomplete blocks may increase 
the accuracy on the whole plot comparisons. Finney [120], Kempthorne [171], 
апа Singh [265] discuss other forms of confounding in split plot designs. і 
п some instances, variation in split plot designs is due to errors in laying 
Out the experiment. The design illustrated in table X-10 was presented as a 
it was found that the "split plot treatments" 


Split plot design, Upon inqui 
Д ашгу, ^ + 
(methods of tirer were laid out in strips across all whole plots (fungi- 


Cides) and that the arrangement of methods of application was systematic 
^n the three replicates. This design is a split block design with one set of whole 


296 Split Plot and Split Block Designs [$ X-4 


TABLE X-10. Example of a split block design with one of the whole 
plot treatments arranged systematically in the three 
replicates (Fungicides = во, аз, аз, ..., 8s; methods of. 
application = bo, bi, bz) 


Replicate I 


а 85 ag ay a7 a 80 86 в: 
1 
2 


Replicate II 


Ce ee Ge 


Ai» M87 


АА ос Moup igs 052 Y "Ape а 
крш БЕ 
br 
be 
P Replicate III 
ay 86 25 OS ag as 8x s7 as 


% 
B 
52 


Analysis of variance 
nce 


Source of variation 
tion 


ar 
Replicate 2 
A = fungicide 8 
Replicate x A = error (a) 16 
B = method 

Replicate x B ] confounded 6 
AxB 16 
AxB x replicate s error (*) 32 


gui ! Qd rg fg 


Total 


plots arranged Systematically in all replicates. Owin, 


g to the error in laying 
out the experiment, the comparison of most interest, 


methods of applications 
ely, as there is no suitable error for testing the B 
effect, fungicides = 
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The split plot design was used effectively by Gowe [138] in a poultry 
breeding experiment to study the length of fertility of sperm in the oviduct 
of two strains of chickens, a; and a;. Two pens representing replicates were 
available. The whole plot treatments were eight cocks, four of the a; strain 
and four of the a strain. The eight cocks were used in both pens, and each 
cock was mated to four dams, two of which were of the a» strain and the other 
two of the a strain, This resulted in a total of thirty-two dams in each pen. 
The breakdown of the total degrees of freedom for the design is given in 
table X-11. The key-out of the total degrees of freedom is different from the 


TABLE X-11. Breakdown of degrees of freedom for Gowe's 
poultry breeding experiment 
Source of variation af 
Whole Plot Analysis _ 
Pens 
Cocks 


Between strains 
Within 80 strain 


Within a, strain ; 5 
Pens x cocks » error (a) 1 Ea 
Split Plot Analysis 


Between strains for dams 
Strains dams x strains cocks 1 
Error (b) wu 


Within Split Plots Analysis 
Between two & dams on same male in 
same pen 16 - 
Between two a, dams on same male in 


same pen 

Total 65 
Ordinary split plot because of the nature of the experimental material. The 
Split plot treatments were not the same for all whole plot treatments. The 
16 degrees of freedom in the split plot analysis are associated with the failure 
of the two samples of two hens of the same strain in the pens mated with the 
same cock to react the same with regard to length of fertility of the sperm 
ìn the oviduct, This results in 8 degrees of freedom for the samples of a) dams 
and 8 for a, dams, After removing the effect due to strains of the dams and 
the interaction of strains dams by strains cocks, there are 14 remaining de- 
ЁТеев of freedom for the error (b) sum of squares, which is regarded as the 
experimental error sum of squares. The within-split plot sums of squares аге 
obtained in the usual manner. The variances among dams and among cocks 
an the a strain may be compared with the corresponding variances for the 
% strain, 

Another variation of the split plot design is introduced when the com- 
Parison among some split or split split plots represents dummy comparisons 
(ie, Comparisons among plots treated alike). Р. С. Homeyer (lecture notes, 
1946, Towa State College) discussed an example involving two methods of 


Ae 


vr 
i] 
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application (co and с), three fertilizers (by, bı, and bə), and four rates of plant- 
ing (do, а, a», and аз) in three replicates. The field design was а conventional 
split split plot design with the rates of planting as the whole plots, the fertilizers 
(equal to none, 100 Ibs., and 200 lbs.) as the split plots, and methods of 
application as the split split plot. Clearly, it is impossible to make two methods 
of application of no fertilizer! The comparison between the two split split 
plots treated alike in each replicate represents intra-split plot variation and 
should be included in the error (c) sum of squares. The breakdown of the total 
degrees of freedom for a design of this type is given in table X-12. In obtain- 


TABLE X-12. Breakdown of degrees of freedom for 
the application-fertilizer-planting ex- 
periment 

Source of variation аг 


Whole Plot Analysis 


Replicate 
Rates of planting = A 
Rates x rep. = error (a) 


a 
Split Plot Analysis 


ns 


aun 
tl 


levels of fertilizers = B 2 = 
AxB 6 - 
Error (v) 16 E, 
Split Split Plot Analysis 
OY Analysis 
Methods of application = C 1 - 
AxC 3 Es 
BxC 1 =, 
AxBxC 3 - 
Error (c) 143424 or 124306 = 28 E. 
Total 71 


ing the B X C interaction sum of squares, only the levels of fertilizer b; and 
b; are used, resulting in an interaction sum of squares with 1 degree of freedom. 
If three levels of b are used, 2 degrees of freedom are available for interaction, 
but one of the 2 degrees of freedom represents a dummy comparison and is 
included in the error (c) sum of squares. Likewise, there are 3 degrees of free- 
dom in the 6 Ax Bx C interaction degrees of freedom which represent 
dummy comparisons. "These 3 are included in error (c). The resulting error 
(c) sum of squares has 28 degrees of freedom instead of the twenty-four in the 
ordinary split Split plot design. The B X C and A X B X C interactions are 
evaluated on two-thirds of the plots rather than on all of the plots, as they 
are in the ordinary split split plot design. 

X-5 Expectation of Ме 
X-5.1 SPLIT PLOT DESIGN 

t M ee шеле for the split plot design composed of p whole ploi 
E E omized complete block design with the q split plo 

om/y allotted to the subunits of each whole plot is 


Хоу = p+ Po + as + 5, 4 В; + (eB); + єн» (X-20) 


ап Squares 
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where и = the over-all mean effect, p; = the gth replicate effect, o; = the 
effect of the ith level of factor а, б; = a random component of error associ- 
ated with the ith whole plot treatment in the gth replicate, 8; — the effect of 
the jth level of factor b, (28):; = the interaction effect of the ith level of factor 
a with the jth level of factor b, and є,:; = a random component of error associ- 
ated with the gijth sub-plot in the gth replicate. 

If the effects are assumed to be random independent variables, the coeffic- 
ients of the various components of variance and of p° are as given in the left 
half of table X-13. The experimenter may be interested only in the particular 
levels of a and b under experimentation. For this situation, coefficients in the 
right half of table X-13 are the appropriate ones for the various components of 
variance, of и?, and of the various sums of squares of effects. The expectation 
of the various mean squares in the split plot design are given in table X-14 for 
both models. These are obtained from the coefficients in table X-13 for the 
appropriate sums of squares. The method used here follows that described in 
Chapter VIII. For example, the expected value of the sum of squares for the 
whole plot treatment X sub-plot treatment interaction for the infinite model 
is 

ЕГУ УУ Х.„у/г — ууХ.га/га — у ,Х..ў/тр + X..2/rpa] 

= {rpq(u? + oa? + og? + таг?) + palo? + ог? + cé)} — {раби + са?) 

+ palo + сз?) + гр(ов + oas) + poc] — (rp? + es) + рас 

+ rq(e.? + cag’) + qlo + o2)] + {гран + pages? + гас? + qes + грав? 

+ rong? + сд) = (p — (a — De? + roas), (X-21) 
with (p — 1)(q — 1) degrees of freedom. 

In the finite case the Уо: = 228; = 22(a8)s = >08) = 0. Тһе ех- 
Pected value of the interaction sum of squares in equation (X-21) now becomes 
(npg? + pq(s 2 + os? + ог) + Уа + PLB? + 12222 (aB)/1 — (пра? 
+ рде 2 + rqS ae + рде + poe} — (тран? + рас + 49 + pr2287 
+ qe] + {грды? + рде è + qo? + cè) 
= (р — Yq — Ioe + TLDs). (X-22) 
with (p — 1)(q — 1) degrees of freedom. 

5-52 RANDOMIZED COMPLETE BLOCK EXPERIMENT 
VERAL LOCATIONS 

For a randomized complete block experiment re 

the yield of a single observation may be expressed as 
Ху = и + Мә + би + оу + (@A)os + ен 
where д = mean effect, A, = effect of gth place or location, б; = a random 
effect associated with the ith replicate at the gth location, a; = treatment 
effect, (oA), = effect of jth treatment at the gth location, and e;; = a ran- 
om component associated with the jth treatment in the ith replicate in the 
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gth location. Assuming that the effects are random and independent variables, 
the coefficients for the various components of variance are as given in the left- 
hand portion of table X-15. If it is assumed that these locations are the only 


ones of interest, then 2A, = J (aà)o; = 0 and the appropriate coefficients 


are those in the right half of table X-15. The expected values for the various 
mean squares are given in table X-16 for both models. 

The appropriate error mean square for treatments is different for the two 
models. For the infinite model the interaction mean square is the appropriate 
error for testing treatments. (This is also the correct error mean square if 
Ше treatments are the only ones of interest.) The error (b) mean square is 
the appropriate one for comparison with the treatment mean square for these 
particular locations; i.e., treatments are to be recommended for each location 
and not for the entire area. 

The above analysis and models are appropriate for other types of experi- 
ments. For example, equation (X-23) is the appropriate linear model for an- 
nual experiments repeated through time. Also, equation (X-23) and the finite 
model are appropriate for artificial insemination and progeny evaluation 
experiments, where p sires are available and these sires are the only ones of 
interest; the dams are the treatments and the progeny records are the subunit 
observations, 

a EXPECTATION OF MEAN SQUARES FROM AN EXPERIMENT ON 
ENNIAL CROPS [197] 

f е layout, yields, Жуу, and linear model for a forage-crop experiment 
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where д = mean effect, 
a, = effect due to gth year, 
Yna = effect of Ath cut, 
(оу), = effect peculiar to hth cut in gth year, 
pi = effect due to ith replicate, 
(ap): = effect due to ith replicate in gth year, 
(yp) a: = effect due to hth cut in ith replicate, 
(аур): = effect due to hth cut in ith replicate in the gth year, 
т; = effect due to jth variety, 
(от) :; = effect due to jth variety in ith replicate, 
(ет); = effect due to jth variety in gth year, 
(ут) a; = effect due to jth variety in hth cut, 
(күт), = effect due to jth variety in hth cut in the gth year, 
(ver) ni; = effect of hth cut on jth variety in the ith replicate, 
(apr) oi; = effect of jth variety in the ith replicate in the gth year, 
му = effect common to jth variety in hth cut in the ith replicate in the 
gth year. 


In the above scheme, only two cuttings, hay and aftermath, are considered. 
The following formulae are developed for с cuttings per year rather than for 
the two in the example. The mean effect и is considered to be a constant. The 
number of cuts in these experiments constitute the whole of the population; 
1.е., there are only c types of individuals in the population of cuttings. There- 


fore, Ey =u Syn = 0 = Уо = 29x = 2s Erè = mh 
h 


à А 
Е(оу) 2 = ay? Е(үр)һ2 = суг, and Elyr)a = esr. Algo, е following 
Summations equa] zero: 


(аур), = 2 eros - 2 ren) му = 0. (Х-25) 


The following expectations are assumed: 

E(eyp)o s = Gays’, E(oyr)o s! = сет, and E(ypr) uf = олн. (X-26) 
The remaining effects are considered to be random independent variables from 
an infinitely large population with zero means and variances peculiar to each 
effect, The average value or the summed values of all cross products equals 
Zero, 
th Since the sample of years represents consecutive years, it might be argued 

at they do not represent, independent variates. Since it is doubtful if the 
Ay endence is of any appreciable size and since an experiment on perennial 

orage crops will never be conducted any other way, the sample of years is 
assumed to be a random sample of years. Cuts, on the other hand, were not 
a arded as independent. Regardless of the desires of the experimenter, there 

n only be hay yields and aftermath yields on some types of forage-crop 


TABLE Х-17. Coefficients of components for expected value of the various sums of squares 
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experiments. Therefore, it would be idle to speculate about the relative 
efficiency of increasing the number of cuttings per year. Two cuts will be avail- 
able each year. It is possible on perennials, but not on biennials, to increase 
the number of years on which yields are taken. However, it is doubtful if 
yields will be obtained for more than four to six years at most. 

The coefficients of the various components are summarized in table X-17. 
"The expectations of the mean squares are presented in table X-18. Since cuts 
represent the whole of the population, the various effects do not affect the 
variance of a treatment, year, or replicate (or any combination of these) 
mean; i.e., there is no sampling error due to obtaining only a sample of cuts 
rather than all of them. Also, as a result of having a finite population of cuts, 


the coefficient c/(c — 1) appears in the expectations of the mean squares in- 
volving cuts. 


CHAPTER XI 


Incomplete Block Designs: General Considerations 
and the One-Restrictional Lattices with 


Treatments in Complete Replicates 


XI-1 Introduction 


The designs discussed in Chapters IV to VI may be unsuitable for experi- 
ments in which a large number of varieties or treatments is used, and those 
given in Chapters IX and X may be unsuitable for experiments in which the 
treatments are not of the factorial type. In response to the need for efficient 
designs for a large number of treatments, Yates [320-8] developed the group 
of incomplete block designs known as quasi-factorials or lattices. Following 

ates’ first discussions on the construction of lattices, Bose and Nair [30] 
and Nair and Rao [226] presented a detailed account of the construction of 
Several incomplete block designs. Numerous other papers [e.g., 48, 60, 71, 
74, 103, 105, 114, 129, 136, 148-50, 175, 177, 178, 222, 227, 255, 259, 266, 305; 
See 113 for other references] have been written on the construction of lattices; 
the general methods for constructing b incomplete blocks of size k for v treat- 
ments in r replicates are: (i) for v of the form р" or k X q" X p” the method 
о confounding in factorial experiments [103, 105, 114, 175, 177, 178, 228, 320, 
322-4, 326, 327); (ii) the use of geometrical configurations [30, 71, 129, 226]; 
(ii) the use of orthogonal latin squares [74, 222, 305, 322]; and (iv) exhaustive 


enumeration. 


Lattice designs may be subdivided into the three categories described in 


hapter IX; i.e., balanced confounding, partially balanced confounding, and 
unbalanced confounding of the contrasts. In constructing lattice designs of 
е balanced type, two fundamental relations are involved: 


die os | (XI-1) 


and : 
AQ — 1) = r(&— 1), 


Th ere b > y and д = the number of times (an integer) а treatment occurs 

My each of the other treatments within an incomplete block. If à is equal 

Ог all pairs of varieties (treatments), the design is balanced. A large number 
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of balanced designs has been worked out and tabulated [26, 30, 32, 60, 71, 129, 
320] for r < 10 and for k of various sizes (table XI-1). Since it is not always 
possible to arrange the incomplete blocks so that all varieties fall within one 
complete block or replicate, table XI-2 has been prepared to illustrate the 
range of values of v and k for which the varieties may be arranged in com- 
plete blocks. It should be noted that there are other values of v and k not listed 
in table XI-2 for which complete block arrangements are possible. Also, it 
would be interesting to know what arrangements are possible for treatments 
appearing two or three times in one complete block. These arrangements have 
not been completely investigated to date [60, 327]; such designs would be use- 
ful in testing for differential effects and for determining the appropriate scale 
of measurement. Since the number of replicates required for balance may be 
large, Yates [320, 322, 324, 321], Bose and Nair [30], and others developed 
the partially balanced incomplete block designs. This class of designs adds 
considerably to the range of values of v and k and allows for relatively small 
values of r (table ХІ-9). Partially balanced incomplete block (p.b.i.b.) designs 
are available or may be constructed for most values of v and k, although the 
analysis may be difficult in some cases. 

As first proposed the analysis for the quasi-factorials or lattices recovered 
only intrablock information; the variance among the incomplete blocks was 
ignored (see Chapter IX). To make use of the additional information available 
in the variance among incomplete blocks, Yates [326, 327] and others [48, 
136, 177, 178, 220, 255] proposed an analysis recovering interblock informa- 
tion. With the recovery of interblock information the efficiency of the lattice 
design (with treatments in complete replicates) relative to the randomized 
complete block design can be only slightly less than 100 per cent, whereas 
the efficiency can be much less than 100 per cent when interblock information 
is ignored. The same is true for lattice designs with treatments not in complete 
replicates relative to the completely randomized design. Hence, the recovery 
of interblock information as proposed by Yates greatly enhances the value 
of incomplete block designs relative to complete block designs, since the 


average efficiency of the former designs to the latter will always be greater 


than 100 per cent with any meaningful stratification or grouping of the 
experimental material into homogeneous subgroups. 


XI-1.1 ADVANTAGES AND DISADVANTAGES 


The chief advantage of lattice designs is that a large number of treat- 

ents may be compared within relatively small blocks, the incomplete blocks. 
Another advantage of lattice designs lies in the fact that they may be analyze 

ete block design 


TABLE XI-1. Balanced incomplete block arrangement with 10 


or less replicates* 


SOLVED 
v b Y k v b r k 
4 6 3 2 15 26 6 5 
4 ы 5 kj 15 5 4 4 
5 10 E 2 15 13 9 9 
5 10 6 3 15 35 T 5 
5 5 4 4 15 15 7 7 
6. © 35 5 2 15 15 8 8 
6 10 5 5 16 20 5 4 
6 15 10 4 16 2 9 6 
6 6 5 5 16 16 6 6 
7 21 6 2 16 16 10 10 
7 T 5 5 19 57 9 5 
7 7 4 4 19 19 9 9 
7 7 6 6 19 19 10 10 
8 28 7 2 21 10 10 5 
8 1» 7 4 21 21 5 5 
8 8 7 7 21 30 10 1 
9 36 8 2 25 50 8 4 
9 12 3 25 50 6 5 
9 18 8 һ 25 25 9 9 
9 18 10 5 28 65 9 4 
y 12 8 6 28 56 9 Я, 
9 9 8 8 51 51 6 6 

10 45 9 2 31 31 10 10 

10 30 9 5 37 37 9 9 

10 15 6 h ш 82 10 5 

10 18 9 5 49 56 8 T 

10 15 9. 6 57 57 8 8 

10 10 9 9 64 Te 9 8 

11 55 10 2 75 75 9 9 

1j 21 5 5 E Кос Hm 9 

u n 6 6 91 91 10 10 


"Shrikhand. Stat. 21:106) has shown that 
46 andr = EE ie ME (d nn. Math. Stat. 23:57) has shown that no solution 


V = 36, b = 45, r = 10, and k = 8. 


m 
o 


NO SOLUTION 
21 uf 
28 8 
22 7 
29 8 


C - Ow 


there is no solution for v = b = 


exists for 


TABLE XI-2. Incomplete blocks in a randomized complete block design 


No. of No. of No. of replicates No. of No. of No. of replicates 
treatments units per СЧ b treatments units per CN S: 5 
block balance others block balance others 

v k r r 


4 2 3 
8 2 1 
8 4 i 
9 3 k 
12 5 - 
16 2 15 
16 [4 5 Я 
16 8 5 & CX 
20 4 - 128 255 2, «e» 
25 5 6 16 - 235 
27 5 15 17 18 2, +- 1 
27 9 15 17 - 2,3 
28 4 9 18 2,3 
30 5 - 18 - 2, 5 
Б 2 31 19 20 2, ..., 19 
2 k A 19 = 42,3 
32 8 31 20 - 2,5 
е 16 p 20 - 2, 5 
56 6 - 21 S 2, 5 
c he 6 - 21 - 2,3 
hg 7 8 22 chin 
c 2 1f e 22 - 2,5 
2 3 2 511 9, >-.› 510 
64 4 21 4 Co ub 
64 8 9 8 75 Br «++, 72 
64 16 16 3 25 
6 эю 65 32 23 55 
72 8 - 64 2, we. 
81 5 ho 128 25 iv 
81 9 10 256 51 D 
81 27 ho 23 24 2, ssis 23 
с 90 9 - 25 = 2, 5 
100 10 - 2% s 2, 5 
c 10 10 - 2, - 2, 5 
121 n 12 5 356 NH, 22,7155 
125 5 31 25 26 25 083529 
125 25 31 125 156 23 away, 299, 
128 2 127 25 - 2,3 
128 4 26 E 2, 3 
128 8 26 А 2, 5 
128 16 3 364 б, aep 265 
pee 22 3 QL 5.9) 
Я =з 6s 127 27 89 2; eaat 
n = 81 Be ass 
И 12 B 243 35 — 2,...) 265 
с 156 12 A 27 ed 2, 5 
хр 5 2 28 575 
G BER 15 - 28 am 2j 
196 14 = 29 30 Bj sers 29 
c 210 M 2 29 > ARA 
А x 30 - 2,3 
15 = 50 a 3 
15 - 31 зо ЭШ УГУ. 
5 121 31 = 2, 5 
9 121 10 x 3 


Fe Аја 

таш лө eu dash (-) means that bal gns 

ЕЦ ore replicates available for some of the partially balanced designs- 
ectangular lattices for k(k + 1) treatments in blocks of k. 


ance is impossible. 
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block design; with meaningful stratification considerably more information 
may be obtained. 
The disadvantages of lattice designs are 


(i) They are more complex computationally; this is especially true when missing 
plots occur [60, 65, 68, 153, 255, 259] or when a covariance analysis is used 
[48, 74, 259]. 
(ii) Lattice designs are not available for all values of v, г, and k. 
(ii) The analysis becomes complex if the treatments are subject to different error 
variances [175, sec. 23.7]. 

(iv) The designs are more difficult to construct. 

The use of prepared arrangements [42, 112] and the use of punched card 
equipment to anhlyze the results from lattice experiments [160] greatly sim- 
plify the layout and the analysis of lattice experiments. Missing data or 
unequal error variances considerably complicate the analysis; if either sit- 
uation is likely to occur, it is suggested that the experimenter improve the 
experimental technique and (or) use a randomized complete block design. 
Also, the selection of more homogeneous material within a given experiment 
Ог of a suitable transformation may eliminate the problem of unequal vari- 
ances. If the differences in the treatment yields are large, it may be inadvis- 
able to use lattice designs, since the variances may be related to the means 
апа would, therefore, be unequal. In such cases the widely divergent groups 
Should be included in separate experiments or in a split plot design with the 
groups as whole plots [47, 227, 324; Chapter XIII]. The partial confounding 
of treatment differences with block effects makes it unwise to employ lattices 
when Comparing treatments which have a large range in yields [305]. 

Ifa design for the required number of treatments is not available, the- 
addition or subtraction of a few treatments is usually all that is required to 
obtain a lattice of the desired size. An incomplete block design with the treat- 
ments not in complete replicates may fit the experimental situation whenever 
a design with the treatments in complete replicates is not available (tables 

1-1 and XI-2). 
XI-1.2 RANDOMIZATION PROCEDURE FOR LATTICE DESIGNS WITH 
ONE RESTRICTION > 

The randomization procedure for v treatments arranged in b blocks of k 
treatments each follows that described in Chapter IX for confounded factorial 
experiments; i.e., the groups of treatments are allotted at random to the b 
Incomplete blocks, and the k treatments within each incomplete block are 
randomly allotted to the experimental units. The randomization procedure 
for other lattice designs is discussed in Chapters XII and XIII. 


XI-ls ANALYSIS FOR LATTICE DESIGNS WITH ONE RESTRICTION 


General methods of analysis for lattice designs are available for v treat- 
ments in incomplete blocks of size k with r replicates of the treatments [30. 
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175, Ch. 18, 26, 27; 177; 178; 255]. The method followed in this text is essen- 
tially that put forth by Yates [324] and Kempthorne and Federer [177, 178] 
for factorial experiments. Rao's [255] analysis is carried through for the 
simple lattice design and for the cubic lattice design. Little progress has been 
made in generalizing the analysis for experiments with n-restrictions. ү 
The analysis of variance for v treatments in b incomplete blocks of size Ё is 


Source of variation df Average value of ms 
Block (ignoring treatment) b-1 — 
Treatment (eliminating block) v—1 a + f (treatment effect) 
Intrablock (kb — v —b + 1) [^ 
ME. — 
Total kb-1 zu 


eee 


Alternatively, the analysis for the above design may be written in the form, 
= ee eee eee 


Source of variation df —— Seep: С 
Observed Average value 
"Treatment. (ignoring block) v—1 — == 
Block (eliminating treatment) b-1 Е, cpu ma 1 
Intrablock (kb — v — b + 1) E oê 
Total bk — 1 == 


SSeS SS = = 2... ee 


Designs with an analysis of variance like the above are treated in Chapter 
XIII. If the treatments are in complete blocks, the following form of the 
analysis of variance is appropriate: 


ae iM mer. 1 Re. cio Ms A 


Р Mean square 
Source of variation df 
Observed Expected value _ 
Replicate r—-1 — E 
Treatment (ignoring block) v—1 
Block (eliminating treatment) b-r 


E с + ш) kop? 
Intrablock -)V-1)-@b-yn Е oe 
| a o 


be Te ү above analysis of variance are discussed in the pre 
XIL 1 designs with two or more restrictions are discussed in Chapt 
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The following formula [177] represents the average variance of the dif- 
ference between two adjusted treatment means in an experiment of v = k” 
treatments in blocks of k treatments in г complete blocks: 

2(k — 1) { т n. m 
alk — 1) e Е ье аеро 
0—1 wF e- Dw wr e w ur" Ese) 
where n; = the number of effects confounded in (г — 1) replicates, 
noz © “ « “ & « (r = 2) « 


n= © G Шы ХЫ 3 * по replicates, 
w = 1/E,, and w = (r — 1)/(rE — E). 
The average standard error of a mean difference is obtained by taking the 
Square root of formula (XI-2), and the average effective error variance (aver- 
age error variance per experimental unit) is equal to r/2 times equation (XI-2). 
In most situations the average standard error of a mean difference is 
Suitable for all comparisons even if the comparisons are of different accu- 
Tacies, since the various standard errors differ little from each other or from 
the average error. However, the various standard errors should be computed 
order to check this point for experiments laid out in a p.b.i.b. design. 
In this connection it should be noted that there is only one standard error 
™ à balanced incomplete block design (b.i.b.). И 
In certain small experiments the degrees of freedom associated with E; 
are less than 10 to 14, and interblock information is usually ignored. Only 
mtrablock information is recovered because Es is so inaccurately determined. 
hus, in designing lattice experiments, particular attention should be paid to 
the number of degrees of freedom associated with E» as well as Æ.. Given a 
number of treatments, v, to be compared, the degrees of freedom associated 
With E, may be increased by increasing the number of replicates or by de- 
creasing the number of treatments per incomplete block. However, the de- 
Crease in the number of treatments per block decreases the number of degrees 
of freedom associated with E,, and a complete block design may have to be 
“sed in order to have sufficient degrees of freedom in the error mean square. 
he block (eliminating treatment effect) mean square, Ey, may be smaller 
than the intrablock error mean square, E.. Whenever this is the case, it is 
Suggested that the experiment be analyzed as a complete block design (as a 
Completely randomized design if the incomplete blocks do not form compact 
тер icates) with no adjustments on the treatment means. 
Bro, SIZE AND SHAPE OF INCOMPLETE BLOCK AND LAYOUT OF 
The optimum size and shape for an incomplete block depends Е the 
nature of the experimental area and material. The optimum shape of an in- 
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complete block may be determined along the lines set forth in Chapter ш. 
The shape is adjusted to obtain experimental units as alike as possible within 
each incomplete block. The area of the incomplete block is determined by the 
size of the experimental units, the relative variability, and the number of 
units per incomplete block. The desirable number of experimental units per 
incomplete block depends upon the heterogeneity among units and the degree 
to which this may be reduced by decreasing the size of the incomplete block. 
The effect of size and shape of the incomplete block on the experimental error 
has been investigated empirically [168; Chapter III]. 

To date there is little experimental evidence concerning the relationship 
between the number of experimental units per incomplete block and the av- 
erage effective error variance. From the results obtained by Cochran [50] 
and Cox [73], there is an indication that the average effective error tends to 
increase as the number of units, k, per incomplete block increases. These 
authors do not give the actual error mean squares but do give the efficiencies 
relative to the randomized complete block design. The efficiencies tend to in- 
crease as k increases. If the error mean Squares were available, it would be 
possible to check the extent of the relationship between k and the average 
effective error variance. If the increase in the error per unit increase in k 
is relatively large, then it will be necessary to use smaller incomplete blocks 
and to pay more attention to the size and shape of incomplete blocks for 
relatively large values of v. F 

The one-restrictional lattice design is suitable for many types of material 
and for many situations. For example, each incomplete block may constitute 
a day, a judge, or a litter; in a field trial the incomplete blocks may be laid 
out end to end, etc. This feature represents an advantage of the lattice design 
over the randomized complete block design, since the extra variation en- 
countered is controlled in the incomplete block design but is not in the ran- 
domized complete block design. In field experiments the calculation of 
efficiency of the incomplete block design with incomplete blocks laid end to 
end, relative to the randomized complete block design, may not be informative, 


since one would not ordinarily use a complete block design with such an ar- 
rangement, 


XI-2 Classification of Lattice Designs 


In general, lattice desi 
I comprising the lattice d 
the lattice designs fo; 
called “the incom 
[60, 175]; this d 
incomplete block 
group of incomp], 
a factorial arran. 


gns may be classified into two broad groups: group 
esigns not forming compact replicates and group I 
rming complete blocks or replicates. Group I is sometimes 
nplete block designs" and group II “the lattice designs 
istinction does not appear logical, since both groups for? 
designs. The term “lattice” is used chiefly to designate ш 
ete block designs in which the y treatments do not represent 
gement, It should be remembered that all designs discuss¢ 
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in Chapters IX to XIII are incomplete block designs; the term lattice design 
is restricted to the group of incomplete block designs discussed in Chapters 
XI to XIII. However, the distinction is not always clear cut. 

The lattices of group I may be classified into the general categories: 


(i) balanced with r = kb/v = A(v — 1)/(k — 1), 
(ii) partially balanced incomplete blocks (p.b.i.b.), and 
(їй) others including one or more restrictions on the allocation of treatments. 


The classification of lattice designs forming complete replicates, group 
H, is given below. 
Size of No. of 
Design incomplete block arrangements 
Two-dimensional with y = k? entries (square lattices) 
One-restrictional 


Simple or double lattice E 2 
Triple lattice k* i 
Quadruple lattice КООН A 
Balanced lattice ке v 
"T'wo-restrictional 2 
Semi-balanced lattice square kt eM: 
Balanced lattice square x is 2 4 
Unbalanced lattice square kt : 
P К k = 6,10 P 
à x k = 12 зл 
Two-dimensional with у = pk entries (rectangular lattices) 
ne-restrictional (preferable for k — p — 1) X 2 
Simple rectangular lattice Е 3 
riple “ “ k* 
Near balance rectangular lattice kee К 
Two-restrictional 27 $ 
Three-dimensional with v = k? entries (cubic lattices) 
ne-restrictional—k? blocks of k А * 3 
riple lattice (commonly known as cubic) kt 1 
uadruple lattice E 5 
Quintuple lattice E у 
на к** kki 
Two-restricti 
k RR Propa] k** 3* 
k rows X k? columns p V 
hree-restrictional * 3t 
k X k lattice square with split plots of k К, 3+ 


k k Xk lattice squares © 


21 
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Three-dimensional with v = kpq entries 
One-restrictional cu = 
Two-restrictional — — 
Three-restrictional = 
Four-dimensional with у = kt entries (quartic lattices) 
One-restrictional 


k? blocks of k k* 4+ 
k? u шү: k* 2* 
К, «фы , k* 2+ 
"T wo-restrictional 
k? blocks of k of k ку re 
k « “kuk kee rs 
k © «рр ; k** — 
k rows X k? columns kt 5 
"Three-restrictional 
k? k X k lattice squares k** E 
k X k lattice square with split plots of k? k** E 
k blocks of k of k of k k** = 


Four-restrictional 


k X k lattice squares with split plots of k Xk lat- 
tice squares k* — 


n-dimensional with k^ entries 
One-restrictional 


n-restrictional 
n-dimensional with Крд «++ t entries 
One-restrictional 


n-restrictional 


where k* = any integer, k** 


b = any prime number or power of a prime num- 
er, and + 


means that more arrangements are available for some values of k- 

The number of arrangements, say n, for a given design represents the num- 
ber of replicates, r, available for the design. Also, any multiple, say q, of the 
number of arrangements may be used as the number of replicates in a give? 
experiment; thus, г = ng. Although the basic set of arrangements may be 
Tepeated q times to obtain the desired number of replicates, this is not recom- 
mended. Instead, n should be as nearly equal to r as possible, because this tends 
io equalize the precision of the various comparisons between means. ЁОГ 
сар le, instead of using two sets, q = 2, of a triple lattice design for twenty- 

AS AE Wohn replicates, one should use a balanced lattice design- 4 
given to date Dd for group II lattices is more detailed than we. 
design containing us a but is still not entirely complete. For d id 
the different replicat me groups of varieties in the incomplete bloc ae 

es [41, 227, 255; Chapter XIII] is not included in 
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classification. However, the above breakdown does serve to illustrate the 
diversity of types of lattice designs available to the experimenter. 

The term dimensional is used to refer to the power of the number Ё in the 
k” lattice design. This term was adopted by Yates [326] in his discussion of 
the three-dimensional lattice known as the cubic lattice in which all the 
combinations of levels of three factors were likened to the points of the 
intersections of the three-dimensional lattice (a geometrical configuration; 
see figure XI-1). Also, combinations of two factors may be likened to the 


m 


jw 


pss 
Figure XI-1. Geometrical configuration for cubic 
lattice designs. 


Intersections of the lines on ordinary graph paper. Ап n-dimensional lattice 
may be thought of as all intersections of the lines in an n-dimensional lattice. 

. The simplest lattice designs, some designs in group I and the two-dimen- 
Slonal one-restrictional lattices of group II, are subject to a single restriction 
Оп the random allocation of varieties; i.e., certain groups of varieties appear 
together in an incomplete block. Lattice square designs are subject to two 
Testrictions on the random allocation of varieties within the complete block; 
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the grouping of varieties is by rows and columns (or some similar grouping) 
within each replicate. The three- and four-restrictional designs are subject 
to three and four restrictions on the random allocation of the varieties. 

Within each of the n-dimensional s-restrictional designs there are bal- 
anced, partially balanced, and unbalanced lattice designs. Balanced lattice 
designs have the feature that every variety (treatment) is compared with 
every other variety an equal number of times in the incomplete blocks (or 
rows and columns). Every pair of varieties has the same value of à. In order 
for balance to be attained and for every variety to be compared with every 
other variety once and only once in the incomplete blocks of size k, it is 
necessary to have k + 1 replicates for k? varieties, k? + k + 1 replicates for 
k? varieties, k* TE -TRII replicates for k* varieties, etc., and k must be a 
prime number or power of a prime number. In designs that are not balanced, 
the various pairs of varieties have unequal values of ^. 

The term rectangular laltice has been used by Harshbarger [148-50] when 
there are k X p combinations of two factors rather than Ё?. Also, he proposed 
this term to contrast with the notation used by Yates [323] and Goulden 
[136], who designate a lattice design for k? varieties as a square lattice. The 
notation used herein is not inconsistent with that used in the literature, even 
though it is in a slightly different form. 

The two-dimensional one-restrictional partially balanced lattice design 
in two replicates has been known as the simple lattice or the lattice. In order 
to be consistent with the notation for the triple lattice [48, 74] and the 
quadruple lattice [136], this design should be called a double lattice. For the 
three-dimensional designs the notation triple, quadruple, etc. is used to be 
consistent with the notation for two-dimensional designs. The prevalent use 
of the name cubic lattice is for the lattice design with & varieties in sets of 
three replicates, which in the present nomenclature is the three-dimensional 
triple lattice; the name cubic lattice is retained for this design. 


XI-3 Two-Dimensional One-Restrictional 

Lattice Designs 
. In constructing lattice designs for k? treatments, the entries are designated 
in the manner described for factorial arrangements, For example, k? = 3° 
treatments are numbered 00, 01, 02, 10, 11, 12, 20, 21 


ting arrangement both as a pseudo-factorial and a 
Beni бырге appears preferable both dee aE 
е quasi-factorial describes the arrangement of v varii 
Bu. кеч terms pseudo. main effect and pseudo-interaction are considered preferable 
action because the latter terms are inappro- 
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effects and pseudo-interactions to distinguish them from the factorial experi- . 
ment. In the above example, then, the following pseudo-effects are available: 
А, B, AB, and AR, with the three levels for each effect. In constructing 
lattice designs for k? treatments, various effects are confounded with incom- 
plete block differences in the different replicates with the confounding spread 
as equally as possible among the pseudo-effects. The analogy between con- 
founding in factorial experiments and lattice designs is illustrated in the 
examples that follow. 


XI-3.1 DOUBLE (SIMPLE) LATTICE 

Suppose that it is desired to set up a double lattice design for k? = 3° 
varieties or treatments. The nine treatments are numbered 00, 01, 02, 10, 11, 
12, 20, 21, and 22. The levels of pseudo-effects are obtained by summing the 
Yields for certain treatments. The relationship between the treatments and 
levels of pseudo-effects are 


Level of pseudo-effect "Treatments 
(А), 00 + 01 + 02 
(А), 10 +11 +12 
(А), 20 + 21 + 22 
(B)o 00 + 10 + 20 
(B) 01 4-11 + 21 
(В), 02 +12 4- 22 

(АВ), 00 +12 + 21 
(АВ), 01 +10 + 22 
(АВ), 02 + 11 + 20 
(АВ?) 00 4- 11 4- 22 
(AB?) 02 + 10 + 21 
(AB?)» 01 +12 + 20 


Since the pseudo-main effects are no more important than the pseudo-inter- 
actions, it matters little which are confounded with incomplete block dif- 
ferences. For simplicity, let the A pseudo-effect be confounded with incomplete 
block differences in one replicate and let the B pseudo-effect be confounded 
In the second replicate. The resulting X and Y arrangements (this notation 
55 followed because of its prevalent use in statistical literature) before ran- 
domization are 


Level of Level of 
Arrangement X effect Arrangement Y effect 
(A)o (B)o 
(An (В), 


(А) (B): 
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'The AB and AP? effects are unconfounded with incomplete block differences 
in both arrangements, and the A and the B effects are completely confounded 
in the X and in the Y arrangements, respectively. Some information is 
available on all treatment comparisons even though the comparisons among 
treatments appearing together in an incomplete block are more accurate than 
the comparisons of treatments not appearing together in an incomplete block. 

Thus, there are n = 2 basic arrangements in the double lattice design; 
the two arrangements are called the basic set. For r replicates r/2 — q of the 
replicates will have the X arrangement and the remaining q replicates will 
have the Y arrangement (see section IX-4.2). The general form of the analysis 
of variance for q sets of a double lattice experiment is 

Mean square 


76 ————o0 
Source of variation df Observed Expected value 
Replicate 2q-1 — m 
Treatment (ignoring block) k*—1 = ad 
Block (eliminating treatment) 2q(k — 1) Ep с? + = 1 Код? 
Component (а) 2(q — 1)(k — 1) — 
Levels of A X replicate (q — 1)(k — 1) E c, + Кез? 
Levels of B X replicate (q—-1)(k—1) — 
Component (b) — confounded vs 
unconfounded levels 2(k — 1) oe + kog?/2 
Intrablock 2qk* — k? — 2gk +1 E oe 
Total 2qk? — 1 — = 


where gẹ is the population variance associated with the random error devia- 
tions in the double lattice and c;? is the population variance associated with 
the incomplete block deviations (see section XI-8.2). The expected values are 
useful in understanding the combination of Е, and E, used to obtain the esti- 
mated amount of information on interblock comparisons. The efficiency 
factor for the double lattice design is k/(k + 1) (formula (Х111-3)). This result 
may be obtained by the method given in Chapter IX. Two effects, A and B, 
are each confounded in one-half of the replicates. The amount of intrablock 
information on these two effects is ¥, and is one on the remaining k — 1 
effects, Therefore, the average information per effect is [2(14) + (k — 1)(1)] 
/(k +1) = k/(k + 1), which is the efficiency of this incomplete block design 
relative to the randomized complete block design when the errors in the two 
designs are equal. 
The method of randomization is as follows: 
G) Assign the numbers ij = 00, 01, *-- 
random. 
(ii) Assign the levels of 
blocks at random. 
(iii) Assign the treatments within 
units at random. 


; k — 1, k — 1 to the treatments at 
the effects or groups of treatments to the incomplete 


each incomplete block to the experimental 


$ XI-3.1] Double (Simple) Lattice 321 


Example XI-1. Double lattice in two replicates 
(i) Usual method of analysis 


After randomization, a plan such as that presented in table XI-3 might be obtained. 
The yields are synthetic and are chosen for ease of analysis. The arrangements are 
allotted to the replicates at random, the treatments making up the levels of the A 
pseudo-effect in the X arrangement are assigned to the incomplete blocks at random, 
and the treatments within an incomplete block group are assigned to the plots at 
random. The same procedure is followed to obtain the experimental randomization 
for the Y arrangement. The incomplete block totals in replicate I are designated as 
(В), (В), and (В), corresponding to the (B)o, (B), and (В): pseudo-effects, re- 
spectively, in replicate I; the sum of the yields of treatments 02, 12, and 22 in repli- 
cate I is the (В)» level of pseudo-effect В for ‘replicate I: 


34+2+6=11= (Biz 


TABLE ХІ-3. Yields per plot for one set of a double lattice experiment with k? = 3? 
synthetic treatments in two replicates (Entries or varieties in parentheses) 


Replicate I 
(Y arrangement) 


(оо) (го) (20) (c) (22) (22) (3) Q1) (01) 
8 5 3 3 2 6 3 7 3 
Block total (B) = 16 (в) =n (В), = 15 
Replicate total : ho 


Replicate II 


(X arrangement) 
2 2 T 


(œ) (00) 
4 6 
Block total (А) ell (А), = 9 (loo = 12 
32 


Replicate total 


TABLE XI-4. Total yields and other totals required for the analysis of the double 
lattice experiment presented in table XI-3 


(4) , 7 2l 


(00) 
(10) 6 | (11) 
(20) т. | (22) 
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In table XI-4 the treatment totals from replicates I and II are arranged in such a 
way that the levels of the А effect may be obtained by summing over the rows and 
the levels of the B effect by summing over the columns. For example, 


(A).o = (8+ 6) + (3 +2) + (3 +4) = 4+5 +7 = 26 
and (B) = (3 + 2) + (7 + 3) + (3 +2) = 5 +10 +5 = 20. 


The next step in the analysis is to copy the incomplete block totals, i.e., (А): and 
(B);;, from table XT-3 in the appropriate places. (А), is placed next to (A).o, (A)a to 
(A).1, etc. This means that the sum of the group of varieties, which are together in the 
incomplete block, is placed next to the sum of the totals of the same treatments. The 
next step is to compute the quantities (А).; — 2(4)»; and (B).; — 2(B)ı; which corre- 
spond directly to the rkc- and rkc, values found in various references [48, 74, 160]. 
The first (А).; — 2(А)»; value in table XI-4 is 26 — 2(12) = 2. The computations 
necessary to obtain the last column and row of table XI-4 are explained later. 

The sums of squares for the randomized complete block analysis (top part of table 
ХІ-5) should be computed first, since all the sums of squares are required for the 
analysis of a double lattice experiment. The latter analysis consists of partitioning 
the randomized complete block error (or residual) sum of squares into two portions: 


(i) that due to the variation among incomplete blocks after the treatment effect 
has been removed — block (eliminating treatment effect) 

(ii) and that due to residual variation after removing treatment, complete block 
or replicate, and incomplete block effects — intrablock. 


The block (eliminating treatment effect) sum of squares is computed from the 
quantities, (A).; — 2(A)»; and (B).; — 2(B)1;; thus: 


k-1 
[(A).i = (А) _ (2E). — Ado}? , (B); — (В? 
E k(1 + 1) -il 2j (xt «Et Ee + m l 


_ ООВ). = (Ву? 
pm. 


(X1-3) 


2(3) 18 


„Ё+# + 8 (-S.(-90es (—8)? 
2(3) NES 
= 0.111 + 8.111 = 8.222. 


f squares of coefficients is 12 +(-1%=2 з 
uares is obtained by subtracting the block (elimi- 
from the randomized complete block error sum О 


m of sq 
Squares 
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squares, 13.44 — 8.22 = 5.22 with (2 — 1)(&? — 1) — 2(k — 1) = (k — 1)? = 4 degrees 


of freedom. The sums of squares and mean squares are summarized in table XI-5. 


TABLE XI-5. Analysis of variance for the data of table XI-3 


Source of variation 


Replicate 
Treatment (ignoring block) 
Error (r. c. b.) 

Block (elin. treatment) 


Among (А) 4 = 2(А)„, 
Among (в); - 2(в),, 


Intrablock = residual 


Correction for mean 


ra 
The average variance of a mean difference between RE ааа ДО 
lattice has been given in general terms by Kempthorne an 1 еа pe E 
(Х1-2)). Briefly, the average variance of a mean difference n. AE А 
manner. Тһе variances of а mean level of an effect іп a pep Ps ru 
founded with incomplete block differences, e. (A)u/k, an San ms 
effectina replicate where it is confounded with incomplete b ad. ат MS. RP 
are o/k and (62 -+ ko a°)/k, respectively, where E, isan мш os RS Dd LAE 
an estimate of ¢2 + kag? in a double lattice with two RE та үг 
mental unit, the estimated variances for intrablock ап et ja yak 
are E, and 2h, — Ea respectively, and the amount of VERS а 
Comparisons is ш = 1/0, = 171.305 = 0.7663, and on in РВ. 
w = l/(2E, — Ej) = 1/[2(2.055) — 1.305] = 0.3565. Тһе averag 


difference between two adjusted means is equal to 


ктт Е ex) 
- 1.305} Т коюл = 1544, 


w— w _ E» — Ee _ 9.122, The corresponding standard error is 1.24. 


k(w +w) А i | s 
i ance. Since a 

For two replicates, formula (XI-4) is the ата one A Ac AT 

Mies. amente dg оша difference between 

Subject to different, variances than others. The standard d ч 3 ir 

two adjusted means for a pair occurring together in an incomp 


PI ELS Ut Е ЕРЕ О (X1-5) 
iu 2w 


Where u= 
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The standard error of a difference between adjusted means for a pair of treatments 
not occurring together in an incomplete block is 


NAM i E 22 f = VET 2а) = 127. (ХІ-6) 


The efficiency of this double lattice relative to the randomized complete block 
design is the ratio of the two effective error variances in per cent: 


1.680 
1544 7 109 per cent, 


or a gain in efficiency of 9 per cent. If the gain in efficiency is small (less than 10 to 
15 per cent), the resulting standard error for a randomized complete block and the 
unadjusted means are used. If the gain in efficiency is larger than 15 to 20 per cent, 
the double lattice analysis should be used and the treatment or variety means should 
be adjusted. In the present example the unadjusted means differ little from the ad- 
justed means, owing to the small gain in efficiency, but the means are adjusted for 
illustrative purposes. 

The first step in obtaining adjusted means is to multiply the (A).; — 2(A)o; and 


(B).; — 2(B)y values by p = ЕУ = 0.122. The resulting values are entered in 


the last column and last row, respectively, of table XI-4. For example, the second 
cz’ value is и [(A).1 — 2(A)a] = 0.122(3) = 0.37, and the last c,” value is u[(B).2 
— 2(B)i2] = 0.122(3) = 0.37. The second step in obtaining the adjusted means is to 
add the corrections c;' and с, to the corresponding treatment total and divide the 
resulting sum by two, the number of replicates. The adjusted mean for treatment 01 
is [5 + 0.24 — 0.73]/2 = 2.26 and for treatment 22 is [13 + 0.37 + 0.37]/2 = 6.87. 
The remainder of the adjusted means (table XI-6) are obtained in a similar manner. 


TABLE XI-6. Adjusted totals and means for the experiment in 
table XI-3 


Treatment Unadjusted 
number totals 


Sum of Adjusted Adjusted 
adjustments totals 


means 


(ii) Analysis of double lattice as a confounded factorial 


In order to understand the various adj iven in @) 
^ ? Justments and sums of squares given in (1); 
it is helpful to obtain these quantities in the manner described in Chapter IX. For W^ 
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method of analysis, it is necessary to obtain the levels of the pseudo-effects in each 
replicate (table XI-7). These effects are obtained by adding the necessary treatment 
yields for each pseudo-effect. (АВ), in replicate I is equal to 


(AB)u = Ао + Хш + Xin = 3 + 3 + 6 = 12. 


Тһе block (eliminating treatment effect) sum of squares is the sum of squares of the 
differences of the levels of the effects confounded in one replicate and the correspond- 
ing level of the effect in the replicate in which it is unconfounded; thus: 


(14 — 12)? + (12 — 9? + (14 — 11)° (40 — 32)? 


3a +) 9(1 + 1) 
‚‚ QL — 16): + (7 — 13)? + (14 — 11)? _ (32 = 40)? _ 
à 30 + 1) SSA wees 


The intrablock error sum of squares may be obtained as the interaction of levels of 
the effects with the replicates in which the effects are unconfounded. The only effects 
unconfounded in more than one replicate are AB and AB*. The interaction of levels 
of these two effects with replicates yields 2 + 2 = 4 degrees of freedom and a sum 


of squares of 
2 


УУ ABYou? _ (Xi. Ха) _ Lam. Bonae zz (АВ?) put 


9-1uc0 k k 2k? k 
(Х...2 + Х..2) X(AB) 2 — X. 
ee L a а (XI-7) 
[2 2k ^ 2g 
= 1374112 4 19 роз 15292 32-40 28-2020 , 727 
3=k 9-E 2k = 6 28 = 18 
42U- 161 + 924 92 4 107+ 7? 3222-400 37° + 18° + IT , _ 4022 
3 = Е поа 2k = 6 2k? = 18 


= 03 – 1): + (12 — 12)? + (15 — 9° + (21 — 16)? + (9 — 9)? + (10 — OE 
k=6 


2(40 — 39)2 2 
e чеш = 5.22, 

Тһе intrablock error sum of squares need not be obtained by subtraction but may be 
Computed as above. The analysis of variance is the same as that, presented in table 

I-5. The weights ш and ш” and the standard errors are computed as before. 

Before computing the adjusted means, it is necessary to obtain the weighted levels 
of the pseudo-effects. This weighting is necessary because the partially confounded 
effects are estimated with different variances in the two replicates. In the replicates 
where the effect is confounded with incomplete block differences, the levels of the 
effect are estimated with a variance equal to 6^ + kég? = 1/w'. The unconfounded 
effects are estimated with a variance equal to 2: = 1/w. The weighted levels of the 
effects are obtained by weighting the level of an effect inversely to the variance with 
which it is estimated. The weighted level of the (A)o effect is 


аА) + w(A)io _ 0:3565(12) + 0.7663(14) _ үз 3650. 
а + = © = 9.3565 + 0.1663 13.3650 
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"The remaining levels of effects are computed similarly and are given in the last column 
of table XI-7. 


TABLE XI-7. Weighted and unweighted effects on a total mean per k — 3 plot basis 
из са Replicate I? Replicate II* inue for or qeu 

15.5650 

11.0475 

13.0475 

12.5875 

8.9051 

13.0475 

12.0000 

12.0 12.0000 


12.0 12.0000 


18.5 18.5000 


9.0 9.0000 
8.5 8.5000 


Total 160 = (3 + 1)ю 128 = (5 + 1)32 144.0 а Gune 


144.0001 


"Underline indicates that level of the effect confo 


ences in the specified replicate. 
* 


unded with incomplete block differ- 


The adjusted means are obtained by using the weighted effects in the last column 
of table XI-7; thus: 


zt — Oi (В) q OB + (АВ; _ (X...) (X1-8) 
ig k k 2k * 
which for treatment 00 is 


13.3650 + 12.5875 + 12.0000 + 18.5000 72 
3 а 6.82, 


and for treatment 02 is 


13.3650 + 13.0475 + 12.0000 + 9.0000 72 
3 6 


В іп Ше 
puted similarly and agree with those in t 
ng errors. 


3.80. 


The remaining adjusted means are com 
last column of table XI-6 within roundi 


(iii) Application of Rao's general analysis to a simple (double) lattice 
In 1947, C. R. Rao 


[255] described a general method of analysis for incomplete 
block designs [30, 107, 


221, 226]. The incomplete block designs are of the type that 
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have v varieties or treatments in b incomplete blocks of k treatments with each treat- 
ment repeated r times. In the double lattice design, v = Ё, г = 2, and b = 2k. 

The double lattice design represents two different groupings of the k? treatments 
in the following manner: 


Y grouping of treatments 


X grouping 
of treatments 


Thus, in the X grouping, treatment 00 appears with the k — 1 treatments (01,02, ---, 
0,8 — 1) inan incomplete block, while in the Y grouping 00 appears with treatments 
10, 20, ..., k — 1,0. Every one of the k? treatments occurs with 2(k — 1) treatments 
к ап incomplete block in the two groupings. Likewise, there are (k — 1)* treatments 
With which a given variety is not associated in an incomplete block. On the basis of 

association" of treatments in the incomplete block, Rao sets up a system of associates. 
First associates (or logically zeroth associates) are treatments which do not appear 
together in an incomplete block. Rao's second associates refer to pairs of treatments 
appearing together once in an incomplete block. 

The terminology zeroth, first, second, etc. associates is adopted herein and is 
equivalent to Rao's first, second, third, etc. associates. The reason for this change is 
to have the degree of the associate refer to the number of times, А, that treatment 
Pairs appear together in the incomplete blocks. 

In the double lattice there are no = (k — 1)? zeroth associates and лу = 2(k — 1) 
first associates for any treatment. Ao = 0 refers to a pair of treatments which are not 
Compared in any incomplete block. А, = 1 refers to a pair of treatments which appear 
together in one of the incomplete blocks. If a pair of treatments appears together twice 
in the b incomplete blocks, А equals 2. However, the association of treatments in the 
Incomplete blocks for the double lattice design is either none or once; therefore, the 
two parameters are ào = 0 and À; = 1. "ws 

One further type of parameter peculiar to a given incomplete block design is the 
number of associates in common among the various pairs of treatments with varying 

“Brees of association. For a pair of treatments which are zeroth associates there are 
Our parameters, po) = (k — 2)? = number of treatments in common among their 
Zeroth associates, po? = pio” = 2(k — 2) = number of treatments in common among 
eir zeroth and first associates, and pi? = 2 = number of treatments in common 
among the first. associates of the two treatments in question. à 
. Likewise, there are four parameters for a pair of treatments which are first asso- 
Slates. Thus, poo! = (k — 1)(k — 2) = number of treatments in common between 
roth associates of the two in question, ро! = pio! = k — 1 = number of treatments 
“common between zeroth and first associates of the two treatments, and р! = k—2 
= number of treatments in common among their first associates. The parameters for 
the simple lattice are summarized in table XI-8 along with Rao’s general notation. 


TABLE XI-8. Parameters for а simple lattice design with the corresponding notation from Rao's paper 


Present terminology Rao's general Present terminology Rao's general Present terminology 
terminology terminology 


Rao's general 
terminology 


k? а number of varieties 


pa 
Е " replicates 2(k - 2) 

" blocks nl (k - 1)(k -2) 
Piy (k - 2) 


" per block 


(k - 2)? 2(k - 2) 


2 
(к = 2) 


(x = 2) 
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With the parameters in table XI-8 for the simple lattice design the following con- 
stants are computed (the letters A and B in this section do not refer to pseudo-effects; 
they are used to retain Rao's symbolism): 


Ao = 2k — 1; 

Au = po! = k— l; А = AnBu — AnBo = 2k*. 
Bu = 1; | (Х1-9) 
Bu = 2k — 1 + (po? — рої) = k + 1. 

Аю = 2(k — 1); check 
Аш = —pa? = — 2(k — 2); 1 
Bo = —1; A = АоВо — AioBoo = 2k. 
By = 2(k — 1) — (pu! — pn). (XI-10) 


Рини 2, (хл) 
а: (X1-12) 


eed = (2k — lw + w. 

її = Дро = (k — 1)(w — w).- 

Bo! = w any ( X A! = Ag'By' — Au'Bo! = 2k*w(w + ш). 

By = Ao + Bo (po? 5 po!) (X1-13) 
= (k + 1); + (k — 1)ш. : 


Aw = 2(k — 1уш + 20. check 
Аи = iM — w')pa? | 
= —2(k — 2)(w — 
By! = —(w-— b A A! = Ax Bw! — Аю'Во' = 2ktw(w + w). 
Bu! = Avy’ + By (py! — py) (xT) 
= (k + 2)w + (k — 2)ш. 
The variance of a mean difference for a pair of treatments which are zeroth asso- 
ciates ig 
2kBw' _ 2 = (XI-15) 
A 7, PES Tw MEE Zw 
and for a pair of treatments which are E ET is 
281 ыр ET EM cR = v (XI-16) 
~klw tw’ 


The os variance 3 a mean difference is 


FG — 2) 
2 


k? = Tp 
Y 


-— 


2 k—1 (XI-17) 
k+ b 2w 


Since there HA Pa — 1)? comparisons among zeroth associates and 2(k — nF >) 


Comparisons among first associates. 
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For Rao’s general method of analysis the parameters for the data in table XI-3 
are 


= 2; =9;b=6;k = 3; 
deci M | (XI-18) 


№ = 0; no = 4; № = 1; n = 4. 


2 2 
ру = ! ) pj = ( ) (Х1-19) 
2 2 2 1 


» 1 E 
From table XI-5, w = r = 0.7663 and ш = EET 0.3565. The constants 


e 
for the simple lattice are now computed as 


Аа = 5, An = 2, Ba = 1, Ви = 4, and А = 18. 
Аш = 4, Аш = — 2, Boo = —1, Bo = 5, and A = 18. 
R = 2w + w = 1.8891, Ay = w — w = -4098, Ao = 0, 
Ao’ = 4.1880, An’ = .8196, Bo’ = .4098, By’ = 3.7782, 
Aw! = 3.7782, Ay! = —.8196, Boo’ = —.4098, Bio’ = 4.1880, and 
A’ = Ao Bu! — Ay’Bo! = Aco! By! — Ao1'Boo’ = 2k*w(w + w’) = 15.487230. 


Equipped with the A;; and Ву, it is now possible to complete table XI-9. The first 
column contains the treatment number, the second column contains the treatment 
totals а (table XI-6), and the third column contains the sum of the block totals con- 
taining a particular treatment; for example, treatment 01 appears in the block totals 
(В) and (A), and the £ value for treatment 01 is 13 +12 = 25. 

The fourth column is computed from columns 2 and 3. For example, the second 
¥ value is Р 

Y = ka — В = 3(5) — 25 = —10. 


The fifth column contains the (k — 1)? 
treatment a. The sixth column contains the 
its zeroth associates. The second є value is c 


treatments which are zeroth associates of 
sum of the y values for treatment а ап 
omputed as 


-10-7-5-6417= —11. 
The 7 values in the seventh column are computed from the formula 


л = бп + Bu)y — Bue _ (k+2)y—e (XI-20) 
A 2k? 
For example, the first 7 value is 


BEDAN = 25 204) = 95 _ 95000. 


With the above values the analysis of v; 
cate and treatment (ignoring block) sums 


blocks within replicate ignoring treatment) 


ariance may now be completed. The po 
of squares are computed as before. The 
sum of squares is 


16? + 11? + 132 + ll? + 92 + 122 (40? + 322) 
i... See ллу е 5.78. 


ISE 


TABLE XI-9. Computations for the simple lattice design by Rao's method 


Treatment no. 


ГА 


о Ум ғ 


2 


o 
5 
T 
5 


m 
ч 


72 = с 


28 
25 
25 
25 
22 
20 
27 
2h 
22 


216 = kG 


@ = variety or treatment total 


A 


B 


с 


р 


20.7102 
1.2495 
6.6669 


3.5484 
13.9134 
3.2985 
5.0077 


1.6595 
20.8101 


TT-0040 = kw'G 


60.5115 
33.9942 
30.8757 
33.9912 
61.2185 
31.9452 
30.2160 


55.8855 
60.2516 


285.0900 


2.99921 


= .56162 


-98585 
-06004 
1.99921 
04668 
5600 


«50019 
4, .0h668 


10.63530 


6.8175 
2.2567 
3.8042 
2.8783 
4,8175 
2.8650 


3.3783 
2.3175 
6.8650 


36.0000 = G/2 


6 = sum of block totals in which treatment © appears = sum of A for treatment Q and its zeroth associates 


1 ', - ' - Ы - -ч' 

AE CEST + Biy ) Boy (в) [(k + 2)w + (k= 2)«]A = (v - w')B 
= ka - = 
Y At 2X?u(w + м!) 

= _ i 
© = zeroth associates of treatment а DsCtx- 2 

` 

€ = sum of the Y values for treatment @ and its zeroth associates (Boi + B1) Y - Boe (k+2)y-€ 


A 2x? 
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The treatment (eliminating block) sum of squares is 
Хут = 51.44. (XL-21) 
Therefore, the block (eliminating treatment) sum of squares is 
5.78 — (49.00 — 51.44) = 8.22, 


which is the same as obtained previously (table XI-5). 
With the analysis of variance the values of w = .7663 and w' = .3565 are com- 


puted. The values of A;/', B;;', and A’ (see above) and the last four columns of table 
XI-9 may now be calculated. 


The first A value is computed from the formula A = kwa — (w — ш)в = 
3(.7663)(14) — (.7663 — .3565)28 = 20.7102. The В value, the sum of the A values 
for a treatment and its zeroth associates, for treatment 00 is calculated as 


20.7102 + 13.9734 + 3.2985 + 1.6593 + 20.8701 = 60.5115. 


The C values are obtained from the formula, 


Г + 2)w + (k — 2)w']A — (w — w)B (хт?) 
2?ш(ш + ш) 

— [5(7663) + .3565]A — .4098B _ 
15.487230 = .27041634A — .02646051B, 


the first C value being 3.99921. 


The D values are the adjusted treatment means and should correspond (within 


rounding errors) to those given in the last column of table XI-6. The adjusted mean 
for treatment 00 is 


D=C+ (s -= LO) = 3.99921 + 2.81830 = 6.8175. (X1-23) 

Several partial checks are available in the construction of table XI-9; these are: 
La = G = grand total = X.... (X1-24) 
DB = kG. (Х1-25) 
Ly = Le = Уу = zero. (XI-26) 
DA = kw'G. (XI-27) 
УВ = [(k — 1)? + УА. (X1-28) 


uC = tat uj Uh + 2)w + (k — 2)u]Y:A — (ш — ш): В) 


rg ju = 1) — wlk — 
- зс 2090-8) и (X1-29) 
Ур = 6/2. 


(X1-30) 
Example X1-2. Double lattice with four replicates (g = 2 sets). In the event that 
two or more sets of the double lattice 


a г experiment are used, only one additional sum of 
Squares is required for the analysis. The numerical example in table XI-10 contains 
four replicates or q = 2 sets of a double lattice. The first set is X; and Y;, and the 
second set is X; and Үз. The method of analysis is general for 4, 6, 8, -- -, 2g replicates- 
The double lattice experiment with 2, 4, 6, etc. replicates is partially balanced; if one 
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replicate is omitted, the design becomes unbalanced, resulting in complications in the 
analysis. In designing a double lattice experiment with four replicates, different ran- 
domizations are required (see table XI-10 for the experimental arrangement) for the 
two sets. The X and ¥ arrangements of any set are kept together, with the arrange- 
ments being allotted to the replicate at random. 

The yield data for k? = 3? treatments in two sets (four replicates) of a double 
lattice design are presented in table XI-10. The incomplete block totals, (А): = (A): 
effect in replicate II, (B); = (B); effect in replicate І, (B)s; = (B); effect in replicate 
ПІ, and (4)4; = (A); effect in replicate IV, and the replicate totals are given in table 
XI-10. Table ХІ-11 is similar to table XI-4 except that the totals are from four repli- 


TABLE XI-10. Yields from four replicates of a double lattice experiment (Entry or treat- 
ment numbers in parentheses) 


Replicate I (т, arrangement) Replicate totel 
(оо) (го) (20) | (ce) (22) (22) | (22) Q3) (02) 
8 5 3 5 2 6 T 
Block total Go = 16 (в) = 11 (в); = 15 hos X... 


Replicate II (x, arrangement) 
(21) (го) (22) | (20) (22) (22) | (01) (ce) (оо) 
2 2 T 3 5 5 2 4 6 
ist ES (25 = 9 ao = 12 2=%,, 


Replicate III (5 arrangement) 


@) ce) Q2 (02) e» 2) (20) (00) (20) 


Block = 
E [rw om зе 36 =X; 


Replicate IV «„ arrangement) 


(Ady ell 


Block total (Ayo = 25 (А) = 12 Bia Ky. 


TABLE XI- 11. Table of total yields and other totals required for the analysis of the 
double lattice experiment in table XI-10 


(5, + Aas (A) 4 
= (х), - 2(x), 


12 + 15 =27 o 
9+11=20 4 
11 + 2 2 


E numbers and totals 


(09) [m 20 
ЁШ tà - 


1549-222 
Cat) 


(в), + ( 
ay t1) 
= o 33116 + ah = 30 


OV - a(x), 
ect 
У. 
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cates instead of two. The (A).; and (B).; totals are obtained by summing the row and 
column yields, respectively. The second from the last column is the sum of the incom- 
plete block totals from replicates II and IV or the replicates in which the А effect is 
confounded with incomplete block differences. The second from the last row is obtained 
from the incomplete block totals of replicates I and TII. The next to the last column 
and row are obtained as in table XI-4, while the last column and row are computed 
after completing the analysis of variance table (table XI-12). 


TABLE XI-12. Analysis of variance for the data of table XI-10 


Source of variation 


Replicate . 1.30 


Variety (ignoring block) 12.11 


0.196 


Error (г. c. b.) 


Block (eliminating variety) 


Component (a) 
Component (b) 


Intrablock 


Total 
Correction for mean 


The component (b) sum of squares is obtained from the comparison of the level 
of the effect in the confounded replicates and in the unconfounded replicates; it is 
exactly analogous to the component (b) sum of squares with two replicates of the 
double lattice except that 2q instead of 2 appears in the divisors; thus: 


SA) = 2007 _ [EG — 200d УВ) — 200 
= 2gk 2qk? 276 
ЖПБ ГОВ) = 2)? 

2e 


1.078 = Eb 


0.656 = E, 


(ХІ-31) 


220234222 6? (5) + (5) +42 (-6)? 
2 ry CIEE CE олт. 

Е The component (a) sum of squares is the interaction of the levels of the effects 
with the replicates in which the effects are confounded. Thus, the sum of the inter- 


action sums of squares for levels of effect A with replicates II and IV and for levels of 
effect B with replicates I and III is 


12? + 15? + 97 + 117+ 1124 12? 322 382 27 --20--23 | 70 


k-3 К =9 Ж=б | = 18 
2 2 
RES +++ дот + 36% 30-2420, 718. 
= Egg 2k — 6 2g = 1 


= 034 + 3.11 = 3.45, with 2(r/2 — 1)(k — 1) = 4 degrees of freedom. 
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The intrablock error is obtained by subtracting the total of the component (a) 
and component (b) sums of squares (block (eliminating treatment) sum of squares) 
from the randomized complete block error sum of squares: 


19.11 — 3.45 — 5.17 — 10.49. 


The weights and the weighting factor for adjusting the treatment totals are 
obtained as 


к 1. ЛА ee 3 i Д 
Ш = ТБ Е, 101.018) — 0.656 — сыр iS) 
1 
w= 0656 = 1.524, (X1-33) 
and i gs Е) 0.844 _ 9.100. — (XI34) 


#= шш)" kaes +a- DE) 8.436 


The average effective error variance is 


rfi T zu] = 0.656[ 1 +оо] = 0.754. (XI-35) 


The efficiency of the double lattice relative to the randomized complete block 


design is 0.796/0.754 = 106 per cent. 
The average standard error of a mean difference between t 


2E, 2k E ХІ-36) 
Ni = ТЕ aul = .6l. ( 
n the 


wo adjusted means is 


The standard error of a mean difference for two treatments appearing i 
Same incomplete block is 


[+ j E +100) _ бо, (X137) 
- z 


; The standard error of a mean difference for two trea tments not appearing together 
10 an incomplete block is 


SE 24) [06560 + 200 _ 63. (XI-38) 
"à 2 
Хе: 


The adjustments for the totals are obtained by multiplying the (А). — 2); 
г values. These values are 


and the (B).; — 2(Y); values by и to obtain the с and су 
then added to e Dou E in the same manner аз for the previous ex- 
ample to obtain the adjusted totals (table XI-13). | i Я 
The intrablock error sum of squares may be computed directly as the interaction 
of levels of unconfounded effects with the replicates in which the effects are uncon- 
.Ounded. Effects AB and AB? are unconfounded in all four replicates. Hence, the 
teraction sum of squares for levels of effects AB and AB? with replicates yields 
“We 1)(k — 1) = 12 degrees of freedom. The effect A is unconfounded in replicates 
land Ш and B in ЧЫЛА IL апа IV. The interaction sum of squares from the levels 
of effects А and B with replicates yields 2(k — 1)(2 — 1) = 4 degrees of freedom. 
© total of the interaction sums of squares yields 16 degrees of freedom for the intra- 


ос. 
k error sum of squares. 
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TABLE XI-13. Adjusted totals and means for the double lattice experiment in table XI-10 


Unadjusted Adjusted Adjusted 
totals totals means 


28.50 
9.50 
15.40 


11.90 


19.90 
12.80 


13-70 
8.70 
25.60 


ХІ-3.2 TRIPLE LATTICE 

The triple lattice design is constructed similarly to the double lattice. 
The X and Y arrangements are constructed in the same manner, but another 
arrangement, Z, is added. The triple lattice experiment may be used with 
3, 6, 9, +++, 3q replicates. The АВ (or АВ?, ABS, ог any other) effect is con- 
founded with incomplete block differences in the Z arrangement. For the 
k? = 3? triple lattice the effects A, B, and AB? (or AB) are each confounded 
with incomplete block differences in one arrangement and unconfounded in 
the other two, while AB (or AB?) is unconfounded in all three arrangements. 
The randomization procedure for the triple lattice follows that for the double 
lattice; thus: 

(i) assign the treatment numbers to the treatments at random, 


(ii) assign the arrangements to the replicates and the level of the effects to the 
incomplete blocks at random, and 


(iii) assign the treatments within the incomplete blocks to the plots at random. 


For more than one set of the triple lattice, proceed in the same manner as 
described for the double lattice. 


The general form of the analysis of variance for a triple lattice is 


Mean square 


Source of variation df ача 
Observed Expected value_ 
Replicate За—1=г— == а 
"Treatment (ignoring block) a к – 1 | = X 
Block (eliminating treatment) 3q(k — 1) E, с? + а= kop? 


Component (a) 3( 1 
q —1)(Е®—1) = кен 
а (b) (k — 1) ж. oe? + 2kog?/3 
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^ The component (a) sum of squares represents the interaction of levels of 
5 е А and B effects with the replicates іп which these effects аге confounded. 
omponent (b) represents the contrasts of the levels of the effects in the 
replicates in which the effects are confounded with the same effects in the 
remaining replicates. 
i Example XI-3. The field arrangement and yields (synthetic) for a k? = 3? triple 
dire with q = 1 set of three replicates are given in table XI-14. The incomplete 
lock totals E in replicates I, II, and III аге (B)i; = (B); effect in replicate I, (АВ?) 
= (AB?), effect in replicate II, and (А)з: = (А); effect in replicate III. 
TABLE XI-14. Yields and their arrangements for a triple lattice design with 3? treat- 
ments in three replicates (Treatment numbers in parentheses) 
Replicate I (Y arrangement) Replicate total 


(00) (20) (20) | (œ) (22) (22) | (21) (22) (02) 
8 5 5 sr i2 6 S PT 5 
Block total а), = 16 (в) = 1 (в) у = 15 ho =X), 


Replicate II (Z arrangement) 


(01) (го) (12) | (22) (г) (00) (20) е) (ce) 
Block total (Ава), = 9 НЕ 20 TE 9 38 = x 


Replicate III (X arrangement) 


(2 (20) (e2) (10) (11) te) (01) (2) (09) 
2X. 


Block total ise 231 (А), = (A)50 = 


TABLE XI- 15. Total yields and other totals required for the analysis of the triple 
lattice experiment in table XI-14 


es 


i 
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The next step in the analysis is to obtain the treatment totals and arrange them 
in the X, Y, and Z groupings. Summing over the rows in the first part of table XI-15 
results in the (A).; levels of effect A. The column totals are the levels of the B effect. 
The last part of table XI-15 is required to obtain the totals for the levels of the AB? 
effect, i.e., the row totals yield the k = 3 levels of effect AB?. 

The (А). — 3(A)ss (B).; — 3(B)1;, and (AB?)., — 3(AB?);, quantities represent 
the comparisons of the levels of the confounded effect from one replicate with the 
unconfounded effect from the other two replicates. The sum of the sums of squares 
of these quantities is equal to the component (b) sum of squares: 


УА): = 3(A)s? — (22[(A).: — 3(4)]]? a DXU(B).5 — 3(Byj* 
qk(l-F 1-4 4) баё? 6gk 


— DIG), = 3j? | X[(AB)., — 3(4B92]? _ (AB). — 3(AB? aol}? 
бда т: ба 
шз Mt, (т) (-9) + 6 _(—10# (ai (ан 
36) 54 18 Ens 18 
- c = 7.67. (X1-39) 


The component (a) sum of squares is the total of the interaction sums of squares 
for levels of the effect and the replicates in which the effect is confounded. For a single 
set, q — 1, the component (a) sum of squares does not exist. For two or more sets the 
component (a) sum of squares may be computed similarly to that for the double 
lattice in example XI-2. The sum of the component (a) and (b) sums of squares yields 
the block (eliminating treatment effect) sum of squares: 

0.00 + 7.67 = 7.67 with 3q(k — 1) = r(k — 1) = 3(3 — 1) = 6 degrees of freedom. 

The intrablock error sum of squares is obtained by subtracting the block (elimi- 


nating treatment) sum of squares from the randomized complete block error sum of 
Squares: 


14.82 — 7.67 = 7.15, 


with (r — 1) (k? — 1) — r(k — 1) = rk? — k? — rk + 1 = 10 degrees of freedom. The 
sums of squares and mean squares are presented in table XI-16. 


TABLE XI-16. Analysis of variance 


Source of variation 


Replicate 


Treatment (ignoring block) 


Error (r. c. b.) 
Block (eliminating treatment) 


Component (a) 
Component (v) 
Intrablock 


Correction for mean 
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The weights, w and ш”, for the triple lattice design are obtained from the intra- 
block error and block (eliminating treatment) variances: 


w = 1/Е, = 1/0.715 = 1.399, (Х1-40) 


w = (r — D)/(rE, — E.) = 2/[3(1.278) — 0.715] = 0.641. (XI-41) 


The weighting factor, и, is obtained as follows: 


2 (Qw—w) _  qgE-E] [1278 —0715] _ é 
^7 бшу Мут ЮЕ Simp. so О 


The product of и and each of the quantities (A).; — 3(A)ai, (B).; — 3(B):;, and 
(AB?)., — 3(AB2),, yields the corrections for adjusting the treatment totals. These 
are entered in the last row of both parts of table XI-15 and in the last column of the 
first part of the table. The adjusted total for any treatment is obtained by adding 

ree correction terms to the treatment total, one for each of the three arrangements. 
The adjusted mean for treatment 12 is 


8 + 0.36 + 0.44 — 0.07] = 2.91. 


Тһе remaining adjusted means are obtained similarly and are given, together with 
the sum of the adjustments, in table XI-17. The sum of the adjustments equals zero 


within rounding errors. 


TABLE X117, Adjusted totals and means for the triple lattice experiment in table 
XI-14 


Sun of Adjusted 
adjustments 


totals 


The average effective error variance for the triple lattice design is 

9 
3 3 pee ЗЕ = 0.71531 + qoom 
RE apum emper js 4 
КЕТ + 3w | z perd | 
m. (X143) 
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The efficiency of the triple lattice relative to a randomized complete block 
design is 
0.832 ^ 111 per cent. 


"The average standard error of a difference between any two adjusted treatment 
means is 


Б-у - Virol Fors] 


= 0745. (XI-44) 


The standard error of a mean difference between two treatments appearing to- 
gether in an incomplete block is 


x Ea + 2y) = 4 Ponsa + 2(.073)] = 0.74. (XI-45) 


The standard error of a mean difference between two treatments not appearing 
together in an incomplete block is 


A Esa + 3p) = 4 Рет + 3(.073)] = 0.76. (XI-46) 


For more than one set of the triple lattice, the analytical procedure follows that 
outlined for the double lattice with slight alterations. 


XI-3.3 BALANCED LATTICE 


The balanced lattice design requires that all pseudo-effects be confounded 
equally in the set of replicates chosen. If one pseudo-effect or interaction is 
confounded in each replicate, it requires k + 1 replicates for the balanced 
lattice. Balanced lattices are available for all prime numbers or powers of 
prime numbers (for example, k = 3, 4, 5, Т, 8, 9, 11, etc.) as well as for some 
other integers. 


А The general form of the analysis of variance for q sets of a balanced lattice 
with k? treatments is 


A————— o c mue 
Mean square 


Source of variation df 
Observed Expected value 
Replicate qk -1-1 E E 
"Treatment (ignoring block) k:—1 — == 
Block (eliminating treatment) 2-1 pre D yep 
q(k ) E, с? + аф + 1) ep 
Component (a) (а — 1)(k? — 1) = с + Ков? 
C 2 
'omponent (b) (k? — 1) — cë + Ss ав? 
Intrablock (k* — 1)(qk — 1) Е, с 
"Total qk(k 4-1) — 1 = 


ar Т o Mc lC ы, С, ee eee 
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Г E XI-4. The illustrative example chosen is for А? = 3? treatments in 
оса се ана Effect, A is confounded with incomplete block differences in 
n es c т: replicate I, AB? in replicate II, and AB in replicate IV. The first 
for a es ede table ХІ-14 and one additional replicate represent the example 
Eon Es d (table XI-18). The 3? balanced lattice with four replicates is 
Blk: Ek 1 uple lattice, but the analysis may be somewhat shortened owing to the 
COM eature of the design. The randomization plan follows that for the triple 


TABLE А 
LE ХІ-18. Yields and field arrangement for а balanced lattice with 3? varieties 
in four replicates (Treatment numbers in parentheses) 
Replicate I (Y arrangement) Replicate total 


(оо) (20) (10) | (c) (22) (22) | (21) (22) (01) 
8 5 3 ee 6 1 
ho =X 


Block t 
otal (в), = 16 (B) = 1 (8), = 5 s 


Replicate II (Z arrangement) 
(or) (ео) Q2) | (22) (22) (90) Q0) (22) (ce) 
1 


Bl 
Ock total (ава) = 9 [CN = 20 (АВ), = 6) 38 2X. 
Replicate III (X arrangement) 
(21) (го) (22) Qo) Qi) Q2) | (01) (0) (00) 
Block tot, LEET 2 забез gin its ИБ | 
а1 (Assn (79 (A), = 2 4X. 
Replicate IV — (W errangerent 
Bloc 4 
k total (AB) ы = 15 (53) = 10 (ав) = 15 36 =X, 


TABLE 
LE XL-19, Total yields and other totals required for the analysis of the balanced 
lattice in table XI-18, adjusted totals and adjusted means 


Adjusted Adjusted 
KV +G treatment | treatment 
= (к + 1)58 = М 1 
о al . 


As wi 3 
th With other latti 1-19) is required. 
е er lattices a table of totals (table X )i 15) for hado bis aud 


alan 
triple la Ced lattice is different from those (tables XI-4 and XI- 
ttice desi d be used if desired. The second 
igns, although a comparable table coul S ird сой corte 


of table XI-19 contains the treatment totals. 
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block totals in which a treatment appears in the k + 1 = 4 replicates. Thus, treat- 
ment 20 appears, by chance, in the first incomplete block in replicates I, II, ПІ, and 
IV, and the corresponding total of the incomplete blocks is 16 + 9 + 11 + 13 = 49. 
The fourth column is obtained by adding k times the treatment total (= ЕУ) to the 


TABLE XI-20. Analysis of variance 


Replicate 
Treatzent (ignoring block) 
Error (r. c. b.) 


Block (eliminating treatment) 
Intrablock 


grand total (=G) and subtracting k + 1 times the sum of the block totals in which the 
treatment appears (=(k + 1)SB). For treatment 01 this quantity is 


3(10) + 146 — (3 + 1)44 = 0. 


After performing the computations in columns 2, 3, and 4 of table XI-19, the analysis 
of variance for the balanced lattice may be computed (table X1-20). The replicate, 
treatment, and total sums of squares are computed in the usual manner. The sum of 
squares of the kV + G — (k + 1)SB = W quantities, divided by (Б + 1), 


Хит 1024 _ 
ВЕ 1) = 108 ~ 949 


yields the sum of squares for block (eliminating treatment) with r(k — 1) = (k + 1) 
(k-1)=#-1=8 degrees of freedom. The intrablock error sum of squares is 
obtained by subtracting the block (eliminating treatment) sum of squares from the 
randomized complete block error sum of squares, 19.11 — 9.48 — 9.63, with (k? — 1) 
(= 1) – (Е – 1). = рз k + 1 = 16 degrees of freedom. 

The weighting factor His 


(XI-47) 


w—w Е-Е. 1.185 — .602 


i = = = -48 
t= Nkw- w) ~ EE, $1185 = 0T. (X148) 


The adjusted totals are obtained by adding pW to the unadjusted total, or by the 
method described in Chapter IX. The adjusted total for treatment 21 is 9 — .60 = 
8.40, and the adjusted mean is (9 — .60)/4 = 2.10. 

The average effective error mean square is 


El + ku) = 0.602[1 + 3(.0547)] = .701. (Х1-49) 
TThe efficiency of the balanced lattice relative to the randomized complete block 


gn is the ratio in per cent of the two average effective error variances, .796/.701 
— 114 per cent. 
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'The standard error of a mean difference between any two adjusted treatment 


neans is 
2 k+1 \/ 2E, E NERO ы Ў 
м (з) 7 Veer t Меат ЫЙ GOD 


АП treatment. comparisons are of equal accuracy, since each treatment is compared 
once and only once with every other treatment in an incomplete block. Hence, only 
à single standard error of a mean difference is required. This is a feature common to 
balanced lattice designs. 

The coefficient of variation is computed similarly to that for all lattices; i.e., the 
Square root of the average effective error variance is divided by the mean of the 


experiment: ne 
101 


146/36 

As with other lattice designs, several checks are available in computing the totals 
of table XI-19. The sum of column 2 equals the grand total. The sum of column 3 
equals k times the grand total, 3(146) — 438. The sum in both columns 4 and 5 is 
equal to zero. The sum of column 6 is equal to the grand total, within rounding errors, 
The sum of column 7 is equal to the grand total divided by the number of replicates, 
Within rounding errors. 
XI-3.4 RECTANGULAR LATTICES 

The two-dimensional one-restrictional lattice design composed of kp 
treatments is known as the rectangular lattice. This class of designs was 
first proposed by Yates [323, 324], and some of these designs are included in 
the general class of designs discussed by Bose and Nair [30]. The designs 
Proposed by Yates had incomplete blocks of size k in one arrangement ч 
Size p in the second arrangement. The analysis and discussion of the particu 7 
designs for k(k + 1) treatments in incomplete blocks of k treatments $ a 
arrangements are given by Harshbarger [148-50]. The analytical e s 
for the designs discussed by Harshbarger is only slightly more complex than 
that for lattice designs with k? treatments in incomplete blocks of k treat- 
ments. Since this is true and since k(k + 1) treatments as well H treat- 
ments may be put into incomplete blocks of size k, this class of designs 13 


an import Д í 

There Es x methods for constructing (Е + 1) rectangular ы 
[60, 148-50, 229. 266]; the method of orthogonal latin squares is the one 
illustrated below. The first step is to select the p — 1 orthogonal p a p nee 
Squares of side p = k + 1 [129]. From these squares, ES Е. а= cr 
Sonal squares for which the leading diagonal can be ordered numerically. 


i are 
Ог example, these squares for the 3 X 3 and 4 X 4 latin squares аг 
4 X 4 squares 


= 2] per cent. 


3 X 3 square 
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If the letter a is used to denote the column number and the letter b to denote 
the row number, if the squares are combined, and if the leading diagonal is 
omitted, the following arrangements for the 3 X 3 and 4 X 4 squares are 


obtained: 


а) 
= арс; 


(3) 
або; = 


(5) (6) 
bsc; ajb:co 


(2) 
азс 


(4) 
абсо 


(4) 
агрус: 


(7) 
Bob;cid; 


(10) 
Bobsc;d, 


(1) 
арос, 


(8) 
arbzcsdo 


(11) 
arbscod2 


a:bocad; 


acbicods 


(3) 
asbocida 


(6) 
asbic;do 


(9) 
asbscod; 


азад 


The next step is to select а letter and to assign identical subscripts to an 
incomplete block. Thus, for the 3 X 2 rectangular lattice the following three 
arrangements are obtained: 


Each group is assi 
following four arr 


X 


а) (8) (11) 


— — NN 


(2) (5) (12) 


Á 


(3) (6) (9) 


——— 


(4) (7) (10) 


(10) (11) (12) 


Ww 


(1) (6 (10) 


_—— 


@ (4 (в) 


_—————— 


(3) (7) (12) 


n 


(5) (9) (11) 


gned to а replicate. For the 3 X 4 rectangular lattice the 
angements are obtained: 


(1) (5) 0 


——— | 


(2) (9) (10) 


== 


(3) (4) GD 


к ——” 


(6 (8) (12) 
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The same procedure тау be followed for constructing other rectangular 
lattices. A number of these arrangements have been tabulated by Robinson 
and Watson [259]. 

To obtain a simple rectangular lattice, any two (usually the first two) of 
the k + 1 arrangements may be selected. Likewise, for a triple rectangular 
lattice any three arrangements may be used; the first three are ordinarily 
selected. If k + 1 groups are used, the design is called a near balance rectangu- 
lar lattice [150]. 

i The general form of the analysis of variance for the k(k + 1) rectangular 
attice is 


Source of variation df Expected value of ms 
Replicate пда—1=г—1 23 
Treatment (ignoring block) kk+)-1 — 

r—1 
Block (eliminating treatment) tk сё кой 
Component (a) nk(q — 1) RE 
n (b) ER xS 
Intrablock error rk? —r—k?-k+1 oe 


Total rk(k 4-1) — 1 


Where q — number of repetitions of the basic plan ofn arrangements, compo- 
nent (а) represents the sum of squares due to the interaction of confounded 


effects and replicates in which effects are confounded, and component (b) 


represents the comparison between the groups of treatments in replicates in 
e treatments in the replicates in 


Which they are unconfounded with the sam men: i 
which the group comparisons are confounded with incomplete block dif- 


ferences. ; 
The weighting factors for rectangular lattices with k(k + 1) treatments 


are of the form, 


(E, — E) 1-51) 
EU 
NEGENDE TE, +[& +1) — Ё/4]Ё, x 
and 
x (X1-52) 


к= 3X | 

The approximate average standard error of а difference between two adjusted 
Means is [60, 259]: 

k?—(n—1)(k—1) 


2E, = а 1--m —21. 
deae e- D^ »)+( E-k-—1 X а 2 


ame as for other designs (Е = block 
E, = intrablock error mean 


The symbols used above are tbe 8 1 
(eliminating treatment effect) mean squares 
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square; п = number of arrangements in the basic plan; д = number of repeti- 
tions of the basic plan, qn — r — total number of replicates; and k(k + 1) 
= the number of treatments). Use is made of the weighting factors to obtain 
weights for adjusting the treatment means. The analysis and the adjustment 
of treatment totals are illustrated below with an example of a triple rectangular 
lattice design. The randomization procedure follows that for the simple, triple, 
etc. lattices described above. 


Example ХІ-5. The 2 X 3 triple rectangular lattice example in table XI-21 was 
prepared by omitting treatments 02, 1l, and 20 from the experiment described in 
example XI-3. The notation aj, bj, and cj follows the method of constructing the three 
arrangements for the 2 X 3 triple rectangular lattice described above. 

Robinson and Watson [259] describe the analysis in detail for the simple and 
triple rectangular lattices. Their procedure and notation are followed to a considerable 
extent in setting up table XI-21 and in obtaining the analysis of variance table. The 
construction of the various totals and differences and the randomized complete block 
analysis are straightforward. The sum of squares for the component (b) equals the 
block (eliminating treatment effect) sum of squares, since there is no component, (a) 
sum of squares; this is equal to 


p (Qs QOO — Q2 0,24 Q,2 = 70.2 
ј20 r(nk— k — 1) r(k + D)(nk — k — 1) r(n — D(k-- D(nk—k— 1) 
(XI-54) 


(-2*-(-39'*---60 (-9'-(-)*t9 (4) (т) фз 
7 9 meee crore re = RU 7 


The intrablock error sum of squares is equal to 4.55 — 3.89 = 0.66, with 10 — 3(2) 
= 4 degrees of freedom. 


The weighting factors are found to be 


_ L648 — 165] _ 493 _ 
A = 30648) F 165 = 5109 = 0-229 
and 


X 
п = 13x = .031. 


The average standard error as obtained from formula (XI-53) is equal to 


pa 
2165131 + 2(.229) – 031] + n + 3(.229) — озу] = 41. 


Тһе Average effective error variance is obtained by squaring formula (Х1-53) and 
multiplying the result by r/2. For this example the average effective error variance 
equals (.165)(1.5124) = .2495. The coefficient of variation in per cent is equal to the 


Square root of the average effective error variance divided by the mean of the experi- 
М .2495 


74/18 Х 100 = 12 per cent. The efficiency of this rectangular lattice design 
relative to the corresponding randomized complete block design is the ratio of the 
two average effective error variances expressed in per cent; thus, .455 X 100/.2495 
= 182 per cent, ora gain in efficiency of 82 per cent. The above efficiency has not been 


ment: 


TABLE XI-21. Numerical example of a 2 X 3 triple rectangular lattice 


Replicate I 
(X arrangement) 


Block по.) 


Yields 


GQ) 8 (2) 5 
(5) 2 (6) 6 
(4) 5 (5) 5 


% 
ba 
b 


"gs icate II Block no. 
X arrangement) 


— III | Block no. 
(2 arrangement) 


(1) 22 (2) 
(5) 8 


(5) 3 (5) 5 
(6) т (1) 8 
(2) 5 (4) 2 


Yields 


(3) 2 (1) 6 
(2) 5 (5) 5 
(4) 2 (6) 7 


9 
WNT 
8 (6) 20 


(5) 


Block subscript 


Replicate 
Treatment (ignoring block) 
Error (r. c. b.) 


Block (eliminating treatment) 
Intrablock 


Total 
Correction for mean 


1.374 
1.145 


.916 
-.229 


Total 22.229 8.5h2 


2 
Mean TA 2.8 


23 
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TABLE XI-22. Experimental arrangement of yields (pounds of ear corn) for 3* treatments 
in blocks of 3 units in 3 replicates 


Replicate I Replicate II Replicate III 


(112) (122) (110) 
30.6 32.0 30.3 2.9 


(001) 
25.0 


(201) 
51.0 


(011) 
3. 


(221) (211) 
29.2 


(100) (200) 
30.8 "31.3 30.6 92.1 
(210) (010) (110) 

26.5 30.4 51.8 88.7 


(oœ) (000) (001) 
29.9 31.6 32.5 94.0 
(210) (211) (212) 
22.5 30.6 29.5 2.6 
(120) (122) (121) 
31.0 27.9 50.0 88.9 
(100) (102) (101) 
32.6 30.6 32.9 96.1 
(011) (010) (012) 
29.7 34.0 32.7 96.4 


(220) (221) (222) 
lel 35:0 55.1 98.2 
(ого) (021) (022) 

32.5 32.6 55.1 100.0 
(202) (201) (200) 

53.0 Sle 512 96.4 


Replicate total 855.5 


Grand total 


(120) 
29.9 
(20) 
29.2 


(210) (100) 
30.7 
(12) 
27.7 


(oœ) (20) 
24.2 29.4 29.5 80.9 


(ого) (120) 
26.7 25.2 28.6 80.5 


(202) 
27.4 


(212) 
28.8 


(201) (101) 
25.9 26.8 24.4 71.1 


(221) (221) 
28.1 28.7 28.8 85.6 


(ce0) (ооо) 
31.0 28.5 28.8 88.5 


(012) 
31.5 
(201); 
55. 
(гоо) 
53.1 


(oce) 
35.9 
(21) 
33-7 


(212) (12) 
51.5 29.9 
(222) (022) 
32.5 33.0 33.1 98.6 
(011) 
31.6 29.5 30.8 91.7 


(210) 
29.5 


Various arrangements are then obtained. For example, the sum of the totals for treat- 
ments 000, 100, and 200 is 90.7 + 93.0 + 95.6 = 279.3. These three treatments 
appear together in the first incomplete block in replicate III and have a block total 
© 92.1 = 30.2 + 31.3 + 30.6. The latter values are denoted as the a values, while 
the former are called the d values in conformance with the notation used by Yates 
[326] and by Homeyer et al. [160]. Table XI-23 conforms to table 56 of the latter 
E The e, b, f, and c values are obtained in a manner similar to that for obtaining 

e d and a values. The j, m, and l values are computed in much the same manner 85 


the (4).; —.3(A)s; values in table XI-15; thus: 
m — e — 3b, (XI-55) 
A ј = 4 – За, (1-56) 
T p (XI-57) 


The other totals and differences are obtained as indicated in table XI-23. 
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Р With the quantities now available in tables XI-22 and X1-23 the sums of squares 
in the analysis of variance may be computed. The sums of squares for the total, treat- 
ment, replicate, and randomized complete block error are computed in the usual man- 
ner. For a cubic lattic analysis the randomized complete block error is partitioned into 
the block (eliminating treatment effects) and the intrablock error sums of squares. 
The former sum of squares is obtained as the sum of three sums of squares; viz., 
(i) component (a) — sum of the interaction sums of squares for replicates and effects 
(A, B, and C) confounded in more than one replicate, (ii) component (b) — sum of 
Ee for the comparisons involving the level of the pseudo-effect (A, B, and C) con- 
ended in two replicates with the levels of the pseudo-effect in the replicate in which 
the effect is unconfounded, and (iii) component (c) = sum of squares of the contrasts 
ine the level of the pseudo-effect confounded in one replicate (AB, AB’, AC, 

С?, BC, and BC?) with the effect in the other two replicates in which the effect is 
unconfounded. The sums of squares described in (i), (ii), and (iii) above are comput- 
able in this manner only if k = a prime number or power of a prime number. If k = 
any integer the method described below is appropriate. 

The component (a) sum of squares is 


D(A A В”): 4 Y = Q4 (С Ө A)? A (4"— В”)? + (В" DA с") + (C2 = A")? 
245 


2gk* 
— 1568.84 + 464.26 + 1797.48 1521.00 + 806.56 + 4542.76 
> z . P D . : XE ү 5 
2(1)(9) = 18 51 85.582, (Х1-58) 


with 2 + 2 + 2 = 6 degrees of freedom. 
The component (b) sum of squares is 
BIE м E(M+ LEL)? t^ (QJ 4- M^ + (M" + Ly: + (LY + Jy 
[7 бак 
= 3150.92 + 4788.94 + 717.48 _ 9177.64 + 11320.96 + 112.36 
= d 48 — р GL —— = 33.093, XI-59 
6(1)(9) = 54 162 : STEM 
With 2 + 2 + 2 = 6 degrees of freedom. 
The component (c) sum of squares is 


LP + Ут + OP, yet ey ШУ 
6qk 6g? 


ул Duy мъ ОМ) + УЛА VEY (е 
бок? ) 


__ 3601.50 + 1000 + 1306.78 | 197.998 


_ 3987.06 + 5167.86 + 1006.78 + 2648.18 + 3279.54 + 3476.86 
54 


= 107.768, with 6(2) = 12 degrees of freedom. 


TABLE XI-23. Computational sheet for 3! treatments in blocks of 3 with 3 replicates 


X and Y arrangements 


(a) (a) (3) (6,7) 


95.2 2.8 3 
(ceo) (220) (220) 
90.0 86.1 90.9 


820.6 (1) 261.5 (A) 36.7 (3) 
272.7 В) -11.4 (А - B 
2.5 (М' 39.2 (J + Mt 
0.855 (Y) 


89.9 95.1 Tl 42.0 
(011 (21) (211) 
95 9%.5 91.9 219.8 91.7 4.7 1.099 


85.6 21.4 3.058 


831.3 (4) 254.4 (A) 68.1 (7) 
291.2 " =36.8 (A - В!) 
= 42.3 (м! 25.8 (J + Mt) 


0.548 (ү) 


B 265.1 215.5 818.8 (1) 272:4 (А 1.6 (д 
i) 96.0 8.2 85.0 265.2 il 9.2 ne в! u 
(m ; - 7.6 12.5 2h.5 29.2 (M' 30.8 (J + Mt 
(ey. ) -2.196 0.950 ' 2.11 0.654 (y) 


g 82.1 818.4 800.2 2470.7 88.1 (A" 106.4 (J" 
R 284.8 275.8 266.5 827.1 s) Бб i». в" Sin 
22.5 -9.0 20.7 - 10.6 (M" 95.8 (J" + a 
А! 252.1 271.3 264.7 788.1 (A" 
Pad m ЕДЕ: 

25. ў i = 28.4 (в" = с" 
М + H) =61:} 1 20:7 -106.4 " * i) 
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TABLE XI-23 (continued) 


Z arrengement 


(000) (100) (200) 
90.7 95.0 95.6 
(001) (101) (201) 
89.9 95.1 3 
(осо) (20) (202) 
88.0 89.2 90-5 
(г) 268.6 t WA 820.5 (h) 
(e) 9.0 ^l CEN 286.5 (C) 34.4 (С - Ағ) 
(1) -13.4 -11.0 -14.8 -39.2 (L) 24.8 (L+ J") 
(с;') = 1.045 - 0.165 - 1.209 0.527 (B) 
(010) (110) (210) 
95. %.8 3 
(011) (111) (211) 
94.5 9 
(012) (112) (212) 
95.7 88.2 9.8 
(г) 80. i H 825.8 (h) 
(c) Ss EE per: ARD) (c) 10.6 ic -A') 
(1) - 8.9 - 3.2 - 7.8 -19.9 (L) -8.0 (L + J) 
(су  -i1& 0.555 = 1:085 -0.170 (P) 
(сео) (120) (220) 
90.0 86.1 90.9 
(01) (221) (221) 
5 91.8 91.9 
(cea) (222) (222) 
98.7 87.7 95.0 
(©) 283.2 265.6 275.8 4.6 (n) 
(с) Ч f E ieri 1 (с) 22.4 (C = AT) 
(1) 2050 p cle I RT 
(су  -eano  -o2à0 - 2:557 -0.12 (P) 
(в) 832.1 818.4 820.2 2470.7 
(cry { Ў 2 855.5 (c") 67-4 (С - А") 
291 53 us -95.8 (L") 20.6 (L" + J") 


353 
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the adjusted mean is 96.832/3 = 32.28. The other adjusted totals and means (table 
ХІ-25) are similarly obtained. 


TABLE XI-25. Unadjusted and adjusted treatment totals and adjusted means for 3° 
treatments in 3 replicates 


Treatment Treatment Totals 
number . * Р number Unadj. Adj. 


88.2 89.2 
86.1 89.088 
91.8 91.617 
87.7 87.119 
95.6 93.132 
88.5 92.408 
90.5 92.585 
88.5 87.866 
91.9 91.756 
89.8 91.665 
90.9 91.255 
91.9 9% 463 
93.0 91.804 
2470.7 2470.700 


(ii) Analysis as a factorial (k = a prime number or power of a prime number) 


The analysis of a three-dimensional lattice design in blocks of k as a factorial 
experiment follows immediately from the results in Chapters VII and IX. The various 
steps are described in detail by Federer [103] and are sketched below. From the data 
of table XI-22 the levels of each pseudo-effect are obtained in each replicate (table 
XI-26).! The next step is to compute the sum of squares among levels of effects in each 
replicate (table XI-27). The sum of the sums of squares for effects within replicates, 
377.96, is equal to the within replicate sum of squares, 462.74 — 84.82 = 377.92, 
within rounding errors. 

The component (a) sum of squares is obtained by subtracting the sum of squares 
for levels of effects in the replicates in which the effect is confounded from the within- 
replicate sum of squares for these replicates. For example, the interaction sum of 
squares of levels of effect A with replicates I and II is 


931 4 18.61 {(290.4 + 284.8)? + a) + EIE + (287.2 + 266.5)? 


_ (855.5 + 827.1)? 
teat = 9.37 + 18.61 — 17.12 = 10.86. 


The interaction sums of squares for B and C are computed similarly. 


The component (b) sum of Squares represents the contrast of the level of the effect 


'It is not necessary to compute all effects tial f ШОН 
(eliminating block) sum of equates. less one wishes to compute the treat: 


L 


TABLE XI-26. Totals per level of effect in each replicate and weighted levels 


Replicate It 


Replicate XI 
Effect level of effect 


Replicate in 
Level of effect Level of effect Weighted levels of effects 
1 1 о 2 
(290-4) (277.9) (287.2) (284.8) (275.8) 256.9 264.7 266.5 266.252 
(286.5) (281.9) (287.1) 281.7 272.6 (252.1) (271.5) (264.7) 277.%5 
sche 274.1 276.7 


254.6 266.0 267.5 265.451 
281.7 272.7 2771.3 262.1 264.5 269.147 


285.2 IRE л) n 2 280.629 
287.4 262.4) (274.5 275.170 


278.7 (272.1) (276.7 


Ё 


РРРРРРЕН 


Sg 


261.5 
(261.3) (254.4) (272.4) 


265.1 265.6 258.8 


85885 


T 
1 . 

261.5 260.1 260.7 188. 277.643 
281.0 275.8 272.9 eee oa 20) 188. 279.145 
284.0 272.5 275.3 282.4) (257.2) (218.5) 788. 278.479 
2ТТ.Т 271.0 272.3 259.8 262.8 788. 272.767 
285.5 272-4 276.0 259.3 265.9 788.1 | 274.300 
285.9 278.1 270.6 268.2 261.5 788.1 | 275.200 
289.9 274.8 278.0 269.5 253-9 264.7 788.1 | 215.955 271.333 


"Parentheses signify the effect confounded with incomplete block differences 


€ 


TABLE XI-27. Sums of squares for the effects within each of the 3 replicates and for components (a), (b), and (c) 


Replicate 
E 1.50 45.88 QA Re 4.34 

: : 2.30 1. 2.91 11.05 2.22 4.3 0.90 
ІІ 5.79 21.158 11.06 0.56 18.55 2.78 5.55 10.61 68.89 1.81 2.43 5.45 123.19 


Component (c) sum of squares 


(Comparison by levels of mean unconfounded effect 
in 2 replicates with effect confounded in only 
one replicate) 


Component (b) sum cf squares 


Component (a) sum of squares 


(Interaction of levels of an effect 
with replicates in which it is 
confounded) 


(Comparison by levels for mean unconfounded 
effect in 2 replicates and mean confounded 
effect in other replicates) 


Sum of squares Sum of squares аг 

A 9.37 + 18.61 - 17.12 = 10.86 2 | А 69.814 + 10.955 + 7.955 - 69.882 = 18.800 2 | АВ 9.209 + 27.735 + 22.427 - 56.652 = 2.719 2 
B 1.80+21.15- 7.21 = 15.70 2 | B 11.590 + 1.185 + 0.712 - 0.69% = 12.595 2 | AB? 23.207 + 23.470 + 11.390 ~ 56.652 = 1.415 2 
C 45.06 + 18.55 - 4.39 = 59.00 2 | C 28.456 + 12.327 + 17.567 - 56.652 = 1.698 2 | AC, 10.518 + 0.000 + 22.170 - 0.694 = 32.054 2 
AC? 2.802 + 0.600 + 5.479- 0.694 = 8.189 2 

ВС 10.140 + 21.156 + 44.627 - 69.882 = 6.241 2 

BC? 1.402 + 43.740 + 81.89% - 69.882 = 57.154 2 

107.772 12 


Sun 


85.60 6 
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in the replicate in which the effect is unconfounded with the level of the effect in the 
other two replicates. For example, the sum of squares for this contrast for pseudo- 
effect A is 
[290.4 + 284.8 — 2(256.9)]? + [277.9 + 275.8 — 2(264.7)]? 
k(1+1+4) = 6k = 54 


, [287.2 + 266.5 — 2(266.5)]? _ [855.5 + 827.1 — 2(788.1) 
54 6E = 162 


= 69.814 + 10.935 + 7.935 — 69.882 = 18.802. 


The component (c) sum of squares is obtained from similar computations. The various 
Sums of squares agree, within rounding errors, with those obtained previously (table 
ХІ-24). The standard errors are computed as before. 

„То obtain the adjusted totals, it is first necessary to complete table XI-26. The 
weighted levels of the effects are obtained by weighting the level of an effect inversely 
to the variance with which it is estimated. The weight for a level of an effect in a 
Троі in which the effect is unconfounded with block differences is ш; the weight 
Ог the effect in the replicate in which the effect is confounded is w'. The weighted 
value for effect (А), is 

[CA)1o + (A)so]w' + w(A)s0 

2w + w 
_ (290.4 + 284.8) (.078398) + -357910(256.9) _ 266,252.  (X1-18) 
.514706 
As other weighted levels of the pseudo-effects are obtained in the same manner. The 
Е Justed totals are obtained in the manner described in Chapter IX. The formula for 
taining the adjusted treatment total for А treatments is 
Хау = zl (A); + (В); + (АВ); + (АВ?) + (С) + (AO + (АС?) з» 
+ (ВО), + (ВС) даь + (АВО) нь + (АВС?) наь + (ABO nion 

(ХІ-79) 


+ ABC) E — rk( + DR 
effects are those in the right-hand 
otal for treatment, 101 is equal to 
0 + 274.808 + 277.643 + 279.632 


V = = the experimental mean. The levels of the 
268.4 ^ table XI-26. For example, the айе 

“01 + 277.923 .260 + 272.657 + 281. 
278.468 + 214767 4 214300 + 275.200 + 275.933) — 36(2470.7)/3(27) = 96.832. 
S Temaining adjusted totals are obtained in the same manner. To facilitate the 
Calculation of adjusted totals, table XJ-28 has been prepared. The levels of the effects 
“te indicated for each treatment. (Table ХІ-28 is useful for all 33 lattices regardless 


of _ 7 36(2470.7) _ seule 
the scheme of confounding.) The figure 362 (= зои) = 1098.0889) is sub 


third of the sum of the appropriate levels of the effects to 


tracted f 
4 Tom r/k? = 

obtain одеа in table ХІ-28. 

va column аа о ова ЫЕ Hoe effect) may be obtained as 


е sum t (elimin: ained c 
described ae Side dcos are computed only from the E 
ч Шеу аге "кы with incomplete block differences. The blocks-within- 


TABLE XI-28. Levels of weighted effects from table XI-26 associated with any treatment; adjusted totals (36x = 1098.0889) 


Treat- (А), (8), (Ав), у (AB) 405 (c), (AC) 4, (АС), о (вс), (8c), ш, (авс), у (АВС) ion (авс), (aBC?) озом 
ment 


PRR ООО NNN ооо NANN PRE vv» HEE ooo 
ооо PRR mmo mmi ооо HEH HEH огоо ооо 
MRO NPO NPO NEO NPO NPO NEO ьо юро 
POM FON PON ONP ONE ONP NPO оно Nepo 
OrN OFM OPN mop Nor NOK HNO PNO ьн оо 
"OM ONP NPO FON ONPE NPO PON ovp оно 
OFN VOH HNO OPN NOP PNO OPN NOH PNO 
ONE оно HOM оно ною ONP HON ONP оро 
Or коо OPN pmo OPN mom OPN NOP PNO 
NPO ONP ною PON NHO ONH ONE WON юно 
pmo оон OFM оҥю HNO Nor оон OPN HNO 


(9) 
о 
о 
(0) 
(o 
о 
о 
o 
o 
l 
l 
l 
1 
1 
1 
1 
1 
1 
2 
2 
2 
2 
2 
2 
2 
2 
2 


PNY PE COG PER HEE OOO PNM BEE ooo 
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replicates — block (ignoring treatment effect) sum of squares is obtained as the sum 
of squares among incomplete block totals within replicates. Other methods for com- 
puting the treatment (eliminating block effect) are discussed by Kempthorne [175], 
Nair [221], and Rao [255]. 

. . The intrablock error sum of squares may be obtained directly as the sum of the 
Interaction sums of squares of levels of the pseudo-effects with the replicates in which 
they are unconfounded with incomplete block differences. For the 3* cubic lattice 
with three replicates the sum of the degrees of freedom for the interaction sums of 
Squares for levels of main pseudo-effects with replicates is equal to zero, for levels of 
the two-factor interactions and replicates = 6(2), and for levels of the three-factor 
Interactions and replicates = 4(3 — 1)(3 — 1) = 16. Therefore, the total number of 
degrees of freedom associated with the intrablock error sum of squares is 


0+ 12 + 16 = 28. 


(iii) Rao's general method of analysis 


Some additional concepts of association among varieties in the incomplete blocks 
of a cubic lattice design are necessary in order to set up the various systems of parame- 
lers in accordance with Rao's [255] general method of analysis. The purpose of this 
Section is to set forth these concepts and to give the parameters peculiar to the cubic 
lattice. "There are two ma jor kinds of association among treatments in the incomplete 
blocks; viz., those treatments associated together in an incomplete block, or first asso- 
Clates, and those not associated in an incomplete block, or zeroth associates. In three- 
dimensional designs, there may be two or more kinds of first associates as well as two 
9r more kinds of zeroth associates. In the cubic lattice design, there are two kinds of 
zeroth associates and one kind of first associates. Any specific treatment will have 
3(k — 1) first associates; i.e., any treatment appears with m = 3(k — 1) other treat- 
ments in the three incomplete blocks in which it appears (figure XI-1). Consequently 
then, there are 2 — 1 — 3(k — 1) = E — 3k + 2 zeroth associates or k? — 3k + 2 
treatments with which the particular treatment is not associated in one or another of 
the incomplete blocks. One set of the zeroth associates is connected to a particular 
treatment through association in incomplete blocks with the first associates of the 
treatment. There are no = 3(k — 1)? zeroth associates of this nature. The remainder 
of the zeroth associates are not associated with the treatment in any way other than 
that the different incomplete blocks in which they occur are in the same replicate, 

here are nw = k — 3k + 2 — no = (k — 1) zeroth associates of this kind. 


For the cubic lattice design the first system of parameters is 


4 , = 0; noo = (k — 15; 
zeroth associates (first type): Аю = 0; no = (k 1 
zeroth associates (second type): Аш = 0; na = 3(k — 1)*; (X1-80) 


first associates (only one type): № = l; m = З(Е = 1); 
г = 3;0 = №; ш 3k; k — number of units per incomplete block. 


The second system of parameters for a cubic lattice design has to do with the 
number of treatments in common for the various types of association. For example, a 
given pair of treatments may have no subscript in a given position in common and 
consequently no common section on the cube, as pictured in figure XI-1. The second 
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system of parameters describing the types of association among a pair of treatments 
with no subscript in a given position in common, say 000 and 111, is 


(k — 2* 3(k — 2)? 3(k — 2) 
pi = | 3(k — 2)? 6(k — 2) 3 (XI-81) 
3(k — 2) 3 0 


The second system of parameters describing the association among a pair of 
* treatments with one subscript in a given position in common, say 000 and 011, is 


(k — 1)(k — 2? 2(k — 1)(k — 2) (k — 1) 
pj -(2&—-06—-2 (6-23*426—1 26-2 | (Х1-82) 


(k — 1) 2(k — 2) 2 : 

The types of association among associates of a pair of treatments which are first 
associates (i.e., compared together in one of the incomplete blocks, or which have two 
subscripts in common and consequently two common sections on a cube, say treat- 
ment 000 and treatment 001) are 


(k — 2)(k — 1)? (k — 1)? 0 
p?-|&-1* 2(k—2)(k —l)  2(k—1) (X1-83) 
0 2(k — 1) (k — 2) 


From the first and second system of parameters described above, it is possible to 
compute the various constants required for the analysis of a cubic lattice design with- 
out recovery of interblock information. Because of the misprints associated with B23 
and Bs; in Rao's original article the formulae for these two quantities are given below- 


Rao's subscripts on the A's, B's, and C's are retained. The constants in reduce 
form are 


Аз = 3k 2; Bi = 1; 

Аз = 0; Asz = 2(k — 1); 
Вз = 3(k — 1) + + (М — Yoo) (pu — рш) + (№ — №) (pi? — pis!) = 3;| (XI-99 
By = (№ = Yoo) (pi = pii) + (№ – doi) (P22 =: po) = 2k—5; 

Сз = —(k — 1); Сзз = 3k — 1; A = 602; 

F =2+k+1); G=—(k+4); Н = —(k+2). 


The variances (without recovery of interblock information) of the estimated dif- 
ferences of the different types are 


V(000-111) = POPTSETÓ 


бз = 1; 


(zeroth associates of the first type). (X185) 
PR + 3k + 4 
V(000-011) = 3g EA) (zeroth associates of the second type). (х1-80) 
Vi * i 2g*(k? + k +1 2hFg? 
SOS) 3p T == (first associates). (X187) 
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"The average variance of a mean difference is equal to 


a? (2E + 5k + 11) t 
кке (XL88) 


In an experiment, c? is replaced by Ze tlie estimated intrablock error variance. For 
q repetilions of the basic set of three groups, each of the above variances is multiplied 
by the factor 1/q. 
. The constants required for the analysis of a cubic lattice design with recovery of 
Interblock information are 
Зи’ 

В = Зи + р: Ajo = 0; An =0; А =ш-– ш; 
(3k — 2)w + 2ш"; Аз = 0; 
3[w + w'(k — 1)]; 
—(k — 1)(w — w^; 


Ais! 
Азз = 2(k — D)(w — ш); Ba 
Bz; = (w — w)2k — 5); C! 
Саз’ = (3k — lw + u^ 

F' = 9R2ww! + 2(w — w’)[w(k? + k +1) — w'(k — AE 

G' = —(w — w)((k + 4)w + 2w'(k — 2)]; 

H' = —(w — w')[w(k + 2) + 2w(k — 1)]; 

A’ = 3k w[2w? + 5ww' + 2ш7?]. 


(XI-89) 


The relative intrablock and interblock weights are w and w’, respectively, and are 
obtained from the cubic lattice experiment. The values R and Aj; replace r and №; in 
equation (XI-84) to obtain the A,;', Bij’, С, F', G’, and H' values. The variances of 
the differences of adjusted means when interblock information is utilized are equal to 


V(000-111) = 2ko*(F’ — С) А", (XI-90) 
V(000-011) = 2ke*(F' — H')/A, (XI-91) 
(XI-92) 


а V(000-001) = 2ke*F"/A". 


Nair [221] describes the adjustments of the treatment means both with and 


Without recovery of interblock information. 


Xt42 THRE 5 WITH MORE THAN THREE 
2 THREE- ENSIONAL LATTICES 3 THA! l 
ARRANGEMENTS (k =A PRIME NUMBER OR POWER OF A PRIME NUMBER) 


. The design and analysis for four or more arrangements of & treatments 
ìn incomplete blocks of А treatments has been described by Kempthorne and 
"ederer [177] and by Federer [103]. Since А must be a prime number or 
Power of a prime number, the method described in examples Х1-1-00) and 
AI-6-(ii), analysis as a factorial, may be used. The &? treatments are likened 
to the k? combinations in a factorial experiment, i 
are set equal to levels of the various effects. Various effects are confounded 
With incomplete blocks in various arrangements. In order to obtain balanced 
Confounding оп all k? + k +1 effects, it is necessary to have k КЕШ 1 
ürrangements (table XI-2 and section XI-2Y А partially balanced incomplete 
block design is obtained by selecting а subset of the k? + k АЎ | arrangements, 
or three arrangements, any three of the k? + k + 1 arrarfgéments Шау 
Selected, For four or more arrangements, it is necessary to &elzup a criterion 
ie Слз 


nt, and groups of treatments 


24 
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for the best selection of arrangements. The criterion used here is that all 
effects are confounded in as nearly an equal number of arrangements as is 
possible. There is no unique order for the arrangements, but the following 
order for 3? treatments is best for most, conditions and criteria: 


Arrangement Effects confounded 
1 А, B, AB, АВ? 
2 А, AG; AC? 
3 B, BC, BC? 
4 AB, BC?, AB'C? 
5 AB?, BC?, ABC 
6 AD?, BC, ABC: 
1 АВ, BC, АВС 
8 AB?, ABC, АВС? 
9 АВ, АВС, АВС? 
10 А, АВС, AB:C? 
п В, : ABC, AB:C 
12 B, AC?, АВС, АВС? 
13 А, BC?, ABC?, AB*C 


If three arrangements are desired, the first three are selected; if four arrange- 
ments are desired, the first four are selected; if five arrangements are desired, 
the first five are selected; etc. This allows the use of any number of replicates, 
whereas the design in section XI-1.1 requires 3q replicates. 

A detailed example is given by Federer [103] for &? = 27 treatments in 
incomplete blocks of k = 3 and for r = 4 replicates. The procedure is gen- 
eralized and may be extended for any number of arrangements. 


XI-5 n-Dimensional One-Restrictional Designs 


The general theory for prime-power lattice designs for k treatments in 
incomplete blocks of size k* (s < n) is given by Kempthorne and Federer 
[177, 178]. A few of these designs are discussed below. The randomization 
procedure follows that for the preceding designs. The treatments are randomly 
assigned to the treatment numbers, The groups of k* treatments are randomly 
allotted to the k»-« incomplete blocks. Then, the k* treatments in each in- 
complete block are randomly assigned to the experimental units. 


XL5.1 К“ TREATMENTS IN BLOCKS OF К* (5 < 4) 


Four-dimensional one-restrictional lattices may be designed in incomplet? 
blocks of size k?, E, or k with the incomplete blocks forming a complet? 
A The number of arrangements possible depends upon the value of k 
M blocks of size k? are used, the design falls into the class of gia 
k? are eel Se енор lattices. If incomplete blocks of size k or of x 
Moe Ps TE Ek-1 arrangements are possible for k a prime number 

and 10, at least four arrangements are possible. 


k^ x ie treatments in blocks of k. The general treatment for 
ents in incomplete blocks of size k with the treatments forming А 
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complete block with four or more replicates has been discussed by Federer 
[105]. Numerical examples for k = 3 and г = 4 and 6 are presented. The 
analysis follows that described in Chapter IX for confounded factorial ex- 
periments. If four replicates are to be used, one system of confounding of 
the pseudo-effects is 


Replicate I —4, B, C, and their interactions, 


Replicate II —A, B, D, *  * , 
Replicate III—A, C, D, *  * dM 
and Replicate IV—B, C, D, “ * 3 : 


The key-out of the degrees of freedom in the analysis of variance is 


EEE чо Шыл 0 0o 
n Expected 
Source of variation cf ms тае of ms 


Load o o QP— o нуына лыы аы а= 


Replicate 3 = at 
Treatment (ignoring block effect) (k* — 1) = = 
lock (elim. treatment effect) 4(k* — 1) Ep oe + 3kog?/4 
Component (a) 6k? — 4k — 2 — се + keg? 
К (Ь) 4(k — 1) Ш ор 
ү (с) 6(k — 1)? — od + keg?/2 
I Ё (d) 4(k = 1)3 = oê + 3kog?/4 
ntrablock error 3(kt — 1) — 4(k? — 1) Е, oe 
Total ht w E 


cO а оор оон з _ ВОДЕ бун һин ыы 


Тһе estimated weights аге w = 1/Е, and ш = 3/(4E, — Е,). Four standard 
errors [105] are available for comparisons of various pairs of treatments. For 
— 3 and r — 4 the average standard error of the difference between two 


adjusted means (see formula XI-2) is 


1 4 12 16 
m + 3ш $i 2w 4- 2w' + 3w + ш 


Four effects are confounded in three replicates; twelve effects are confounded 
11 two of the four replicates; sixteen effects are confounded in one of the four 
replicates; and eight effects are unconfounded with incomplete block dif- 
erences in any of the four replicates. The latter eight effects are subject toa 
Variance equal to 1/4w = Е./4. The adjusted totals are obtained in the same 
Manner as for confounded factorials with recovery of interblock information 
[Chapter IX; 105]. 

XI-5.1.2 kt treatments in blocks of k’. The class of four-dimensional 
ene-restrictional lattices in incomplete blocks of size k? is not very important 
rom a practical standpoint, since the block size becomes large for Ё 7 2. 

Ог k = 2, reference is made to Chapter IX for the design and analysis. 


8 
+ iat’ (X1-93) 
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ХІ-5.2 k* TREATMENTS IN BLOCKS OF К (s < 5) 

XI-5.2.1 А5 treatments in blocks of k. The design and analysis of 
five-dimensional one-restrictional lattices with an incomplete block of size 
k follow that outlined in section XI-5.1. The designs of practical importance 
are for Ё = 2 and 3. An illustration for Ё = 3 is provided by Kempthorne 
and Federer [177]. The minimum number of replicates necessary to obtain 
intrablock information on all comparisons is the integer equal to or next larger 
than the ratio n/s; thus, n/s — 5/1 — 5 for the present lattice. 


XI-5.2.2. k? treatments in blocks of k?. The lattices of this group 
are discussed in detail by Federer and Robson [114]. A numerical example 
with Ё = 2, systems of confounding for k = 2 and 3, experimental arrange- 
ments for-k = 2 and 3, and the general analyses are discussed in their paper. 
For k — 2 and 3, confounding systems required for balanced arrangements 
are investigated along with the types of confounding leading to designs ap- 
proaching equalization of confounding on the various treatment contrasts. 
The minimum number of replicates necessary to obtain intrablock informa- 
tion on all treatment contrasts is n/s = 5/2 = 2.5 or 3 replicates. 


XI-5.2.3 k’ treatments in blocks of k’. А specific example has been 
chosen to illustrate the design and analysis for a five-dimensional one-restric- 
tional lattice design in incomplete blocks of size °. It was desired to compare 
thirty-two treatments in a greenhouse experiment. The available number of 
greenhouse pots was 4(32) — 128. The experimenter (R. E. Clark, Cornell 
University) suspected that the variation among complete blocks would be 
larger than the within-block variation for some of the characters under study- 
Furthermore, for one of the important characteristics it was impractical to 
observe more than eight experimental units per day. He suspected that the 
day-to-day variation might be rather large in comparison with the within-day 
variation. For these particular conditions an incomplete block design of eight 
experimental units with the treatments in complete replicates was indicate¢ 
Since all treatment contrasts were of equal interest, the thirty-two treatments 
were randomly assigned to the treatment numbers for a 2° factorial; ©” 
ijhgf for i, j, h, g, = 0 or 1. The four arrangements given in table ХІ-29 were 
then constructed. The 2? — 4 groups of eight treatments each in each replicate 
were assigned to the four incomplete blocks (position and day) at random, an 
then the 8 — 2? treatments within a group were randomly assigned to the 
eight greenhouse pots. 

The key-out of the degrees of freedom and the expectation of the me? 
ШАШ are given in table Х1-29. The total, replicate, and treatment (ignor! ng 
rent aie Cr AS in the usual manner for a random 
gini aie eg e randomized complete block error sum of $90 dn 
intrablock. Both sums 5% Г ү азарае effec? the 

Squares may be computed directly, although 
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intrablock error sum of squares usually is obtained by subtraction. The block 
(eliminating treatment) sum of squares is the sum of the twelve sums of 
squares, each with a single degree of freedom, for the contrast of the mean 
level of an effect in the replicate in which the effect is confounded with the 
mean level of the effect in the replicates in which the effect is unconfounded. 
For example, A is confounded with incomplete block differences in group 1 but 


TABLE XI-29. Four arrangements for 25 treatments in blocks of 2* 
block no. Treatments in the incomplete block 


Group 1 (A, В, AB confounded) 


mw ю юы 


Group2 (C, D, CD confounded) 


er AU 


Group 5 (E, AC, ACE confounded) 


Source of variation 


Replicate 
Treatment (1gnoring block) 
Block (eliminating treatment) 


Intrablock 
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of freedom for E and the probability distribution for E remain unknown, 
exact tests of significance are not possible. 

Several approximate F tests are available for testing the equality of treat- 
ment means. For example, one could run a randomized complete block analysis 
and compute the following F test: 


E —. treatments (ignoring block) ms — 7 
о Гела (паа N randomized complete block error ms i? 


The degrees of freedom for both mean squares are known, but no use is made 
of the intrablock and interblock information. As a second F test, one could 
use the following: 


Bric} yc А 
Fiv — Land (o — 1)(r — 1) —b + raf] = treatments (eliminating block) ms. 


E. 
(Х1-98) 


Assuming additivity and normality of effects, F; is an exact test with known 
degrees of freedom. However, this test makes no use of the interblock infor- 
mation and is difficult to compute for some comparisons among the treatments. 
A third F test is 
Pin (sum of squares among adjusted treatment totals). (Х1-99) 
м r(k" — 1)(average effective error variance = E) 


The last test is useful in practice, but the number of degrees of freedom asso- 
ciated with E is unknown. Preliminary investigations indicate that the num- 
ber of degrees of freedom for the average effective error variance is approxi- 
mately the same as for E.. The test utilizes both intrablock and interblock 

- information, and the total treatment degrees of freedom and sum of squares 
may be partitioned into subsets. Cochran [48] provides an F test for double 
and triple lattice designs without stating precisely the number of degrees of 
freedom associated with the mean squares. It is implied from the application 
of the formulae that the numerator of F has y — 1 degrees of freedom and the 
denominator has the same number of degrees of freedom as E, [48, 60]. 
Cochran [48] proposes the following F test for lattice experiments with Г 
arrangements of the k? treatments in blocks of k units: 


(т — 1)(w — ш) w! 
Каас i zB. - (6 - 5E 
F- (т — uw t w (eta 0) (b — r) үйө 


where V, = treatment (ignoring block) sum of squares, В, = blocks within 
replicate sum of squares, b = number of incomplete blocks, г = number of 
replicates, and w and ш” are the weights. For the balanced lattice, P 38 
equivalent to Ёш, but this is not true for all other lattices. 


1Cochran presents the formulae for the si i i deem аге 
easily generalized to obtain formula (XI-100). S fay crop cesi; she velis 
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A fifth test has been proposed by Rao [255]. He makes the assumption 
that the weights are known and uses a chi-square test. The use of chi-square 
18 equivalent to using F with infinite degrees of freedom in the denominator. 
Since only estimates of the population variances cè and cg! are obtained, 
the use of chi-square tends to give too many significant results. If Rao's chi- 
square is divided by (k? — 1), it is equivalent to Е. and Fa for the balanced 
lattice. 

For practical purposes it is suggested that Fa 
because of its ease of application. Some adjustment to the degrees of freedom 
associated with the average effective error variance may be necessary. With 
large lattices such a correction is of small consequence and may be omitted. 


be used by the experimenter 


XI-8 Least Squares Estimates for a Double Lattice and 
Expectation of Mean Squares 


Assume that the ijth treatment yield in the gth replicate is repre 
by one or the other of the following linear equations: 


sented 


Xu = wt pr + В: + оту + ei 


Xoy = p + pa + Bas rud es 


(XI-101) 


where u = mean effect, 61; = incomplete block effect in the X arrangement, 
Boj = incomplete block effect in the Y arrangement, po = replicate effect in 
the gth replicate, 7; = treatment effect, ej; = а random effect, and i, j = m 
2, +++, k. Two equations are used instead of one in order to simplify the no- 
tation! 

FECTS WITH RECOVERY OF 


XI-8.1 LEAS 
: T SQUARES ESTIMATES OF EF 
E CTURE NOTES, 1945; 48] 


INTERBLOCK INFORMATION [COCHRAN, LE 


The sum of squares due to intrablock variations is equal to 


Hace Xd} [Ja — â — & — Bu — ful + [Xs — â — ё — B; — Du 
dr (X1-102) 


and that due to the block constants or interblock variations is equal to 


Re ух — ha kh = JA + 22068 А7 к — Ут). 
ү | (Х1-103) 


е lattice, a third equation may be used for 
i i in order to denote 
arrangement; if АВ 0! у ct is [n in or te 
Л гапветепі; i is con! ther f : 
note lar block and treatment com! he incomplete block. This 
lon may be used for a quadruple, ** 
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The deviations in R, and R, are weighted inversely to their true standard 


deviations, 4/1/e and 4/1/«'. This procedure results in minimum variance 
unbiased estimates. The weighted sum of squares is 


Ву + o'R; = В, (X1-104) 
where w = 1/02 and w’ = 1/(¢2 + ke). Following the procedure of earlier 
chapters the partial derivatives of В are obtained, the resulting partial 
derivatives are set equal to zero, and the resulting equations are solved to 
obtain the least squares estimates, д, ôs, B; f; and 4,5, of the true effects. 
Hs Po, Bri, В, and т;у, respectively. 

ОБ Т -w22 (Xis — A — fi — Bu — 8) + (Xay — A — & — Bu #0) } 
hs DXi. — ha — kh — 378) +E ka — ki — 1282 | 
i E j i 
= 0. (XI-105) 
If Dô = Уё, = У; = DIM = 0, then 
i j 


в = ap. + Xa) а. фа) =  (Х14106) 


3857 7202 UG —йЁй—Ё&—Йи— у) = 0. (XI102 
li 


Bu = Кл. = 28) zi. (XI-108) 
= = =w (Хы — P — fr — у tu) = 0. (XI-109) 
By = 106; = 3) fh. (XI-110) 
The sum of equations (X1-108) and (XI-110) is equal to 
Bu + bs = Хы. H Gy — Улу Y» — 2p. © (хш) 


oR 


ahi as — A р Bas — fy Xu — B— fr — Bry — f) 


20) 
oa p Ar- == Ер = Rp, — УФ, + Х.; ce kp hà kôz = Ут) 
= dats; — Qa(Xay + Xa) + 200, + Ё) + 4(o + ш) 


20' 
= (Хк + X; – У, =- У») = 0. (Хт-112) 
Rearrangement of formula (XI- 


А 112) and substituti ight-hand side 
of equation (XI-111) for the t Substitution of the right-han: 


antity 1; + Êz; result in the following: 
e 


20 + 2u/0 — e Фуу, +54) 
2 J 


= «(А + Хы) — (о i e ex dx. (Х1-113) 
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If the above equation is summed over the subscript i and is divided by k, 
we obtain 


Doy зат — PSA) 


= 106; + Ж) - wx, EX). KTA 


Therefore, 
wt tu + ар = Xay + Xai) — a ) x. + ВХ.) 


" 07, 
== БХ F C — © үз е yen (XI-115) 


Likewise, 


(Ci Й ow! oy (0—0) oT 
= D = уу + 202 = PES ae po SITES Хз... (XI-116) 


Adding the above two equations, we obtain 
CHAN (Ys; t Yn) = non + Xai.) + Pc + Хз.) 
T 7 


FE eas. + Х»..) — 40 


= w(i; + Gai.) + o (E. + 2.) — (o + e^) Gh. + ®..). 
Substituting the above result for (Er + XU) in equation (XI-113) and 
Н j 
multiplying the result by (w + w’) we obtain 


2®(ш® + о)? + 2w (o + e')8 — (e — wo!) lo(£i-; + ®н.) + wo! (Ēri. + 32.5) 


= @ o^) (&.. + %..)) 
= ow тыл + Хы) — (02 — ®)(@н. + 225). (X1-118) 


we obtain the following: 


— 2н. БА; – 2.;}. (Х1-119) 


(ХІ-117) 


After collecting and rearranging the terms, 
a 1 (o — 9) (=. 
1 +0 = Dec + Xa) + а 872; 
eatment mean is the one 


4 Xz), is adjusted for 
.), multiplied by a 


The above least squares estimate of the adjusted tr 

Used in example XI-1. The unadjusted mean, 20s E 

the correction for block means, à(Zsi- — йн. + 215 T 
Weighting factor equal to (ш — #’)/(@ + o°. 

If interblock information is ignored, the least squ 

Ху + Xu + En. — ii- + д; — 9). 

The block correction is not dampened down by the weighting factor as it is 


when interblock information is recovered [322]. 
f more than one set of a double lattice 1s used, the procedure follows 


through in the same manner as for one set. The observations in the X arrange- 


ares estimate is equal to 
(ХІ-120) 


aet c Аш рыб ылар АЖ Ал „Ж, Lr A ene 
set of a double lattice is 
Х..2 1 : 7 A r 
Бээ = z = Dref eu T Ўр + Ви + Bos + 2r + | 
+ єз)? — эри E o, + RB + ЕЎ Вз; + 2059 r; + LL Leis)” 


Т 2 Fej + Bog + 2ko + Bot — (ut + Kop + kop + 202 + od) 


= @—1)}вг+ i 7 + 2o}. (XI-121) 


In obtaining the above expectations, it is assumed that the effects are inde- 
pendent and additive, that E82 = Eg? = vg, and that the remaining 
variables squared have expectations џ?, c 2, о,2, and cd. 

The replicate sum of squares has the expectation: 


B [1yx.. P ^ ] = 2008 + 0) + 2ko + 267 + 202 — ELX.. 2/28] 
= сд js hog? + Ro 2, (X1-122) 
The expectation of the block (eliminating treatment effect) sum of squares 
is equal to 
Coe = Xu | Mis Ж) (А. ai] 
4 р ЫЙЫ атт ae 
= 2k — D? + keg/2), КЕ 
with 2(k — 1) degrees of freedom. 
The expectation of the total sum of squares is 
Х..2 
E| EEx, = =] 
- GR De? + (08 — Bog? + 208 руз + Rey (XI-124) 
with 2k? — ] degrees of freedom 


Source of variation df Expected value of ms 


o? + kog? + Кт? 


Treatment (ignoring block) к 1 as reef oa aah 
ASA H Т 

Block (eliminating treatment) 2(k — 1) 

Intrablock 


oê + kog?/2 
(к — 1): m „5556 


(lormula X1-98), but no use is made of the interblock information 1n this test. 
The expectation of the component (a) sum of squares in more than one set 
of a double lattice is equal to cê + kag? [see 302]. 


XI-9 Derivation of Standard Errors 
XI-9.1 THE DOUBLE LATTICE 


"The variance of the difference between two adjusted means from an ex- 
Periment designed as a double lattice may be developed in the following 
manner, Consider the adjusted means for treatments 00 and 01. Then, 


eee k—1 ++ 
Soo = [(A)o + (ја afk (AB)o + V (AB Jo MA == 


апа 
Xu 


k 2k 
Vw — 4ш) = ув, — В). + (АВ) — (АВ). + +++ + (AB*7), 
- (авы), b= 1 fol(Bn— Baltel- (B, (дв), — (AB) 
_ (2с) + («’)2(2k) (cer + kog’) 


d = 


[Ae + (Вг 4 (AB) +: + (ABT ns _ 


+... (ABE), — (AB) - (о + w’)? 
Ak — Do? 2 ak—-1)_2f_ 1 к= ү (Х1-125) 
ar E oc = ko +o) 2ko Elo Tow 2% 


Where c? = 1/ and o2 + kag? = 1/o' (see formula (XI-5)). 
The variance of a difference between a pair of adjusted means of treat- 
Ments not appearing together in an incomplete block, say 00 and 11, is 


(1 = n UN 


V(zw' — 21) = V 


(AB) — (АВ) + --: + (АВ), = (ABE) 
ле k 


Аво +0!) , 2k(k 2) _ | Г E 3 i 
о ol)? ОШАК 
(see formula (XI-6).) 


X1-9.2 THE CE-DIMENSIONAL TR 
TICE) E TIIREE-DIM 


There are three types of comparisons among pairs of varieties in the 


IPLE LATTICE (THE CUBIC LAT-. 
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ment are denoted by Хуу; and in the Y arrangement by X;;;, where f = 1, 2, 
+++, q = number of repetitions of the simple lattice. 

ХІ-8.2 EXPECTATION OF MEAN SQUARES 


The expectation of the treatment (ignoring block) sum of squares in one 
set of a double lattice is 


BLED xa = ES F lyzo, zl Deo + В: + Boy + 2ту + е; 


F е)? — pku + Ep, + ЕЁ? Ви + RB; + УУ; + УУК | 


= 2002 + Мо 2 + Hop? + 2Ме^ + Rod — (202 + Kos + hog? + 202 + 02) 


k 
ps 2 2 2 XI-121) 
(e — nie. tg ti. ( 
In obtaining the above expectations, it is assumed that the effects are inde- 
pendent and additive, that Eg,? = Ef»? = cg, and that the remaining 
variables squared have expectations џ?, c 2, ¢,2, and c2. 

The replicate sum of squares has the expectation: 


Е| 1x. 4. zx] = 2k? + о) + 2heg + 20, + 202 — E[X.. 2/2] 
g 
= с? + kop + kèo è. (Х1-122) 


The expectation of the block (eliminating treatment effect) sum of squares 
is equal to 


(Xs. — Xi)? (Х.; — А.) Thus zn 
zx 2k А 2 га 2k? 
= 2(k — (сз + hegi/2), 1123) 
with 2(k — 1) degrees of freedom. 
The expectation of the total sum of squares is 


|ууух,г Ж 2s] 

= (2E — lo? + (2k? — Rog? + 2(% — 1)o,? + 10,2, (Х1-120) 
with 2k? — 1 degrees of freedom. The intrabl 
tained by subtraction. The above results are 


ock error sum of squares is ob- 
summarized below. 


Source of variation df Expected value of ms 
деш 1 oe + Ков? + Кор? 


Treatment (ignoring block) 


Block (eliminat: 
Intrablock 


(k — 1): oe 
2 


k-1 sê + 


LE 2о;% 
i ЕГ ri T 20: 
ing treatment) 2(k — 1) й 
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In the above analysis, it will be observed that no appropriate error exists 
for comparing the treatment (ignoring block) mean square. The expectation 
of the treatment (eliminating block) mean square is c? + 20,2. Hence, the 
intrablock error mean square is appropriate for testing this mean square 
(formula X1-98), but no use is made of the interblock information in this test. 
The expectation of the component (а) sum of squares in more than one set 
of a double lattice is equal to c2 + ke? [see 302]. 


XI-9 Derivation of Standard Errors 
XI-9.1 THE DOUBLE LATTICE 


The variance of the difference between two adjusted means from an ex- 
periment designed as a double lattice may be developed in the following 
manner. Consider the adjusted means for treatments 00 and 01. Then, 


eee к—! +++ 
Zw = LA Eae + (AB)o + = (АВ) _ = 


апа 
ви = И» + (Вы 4. (AB) + + (АВ ъз e 
Уба — à) = EV (Bo — Buca (АВ) — (AB + + (АВ 
T BM), be Lye» = (Bul 2 иб» = (Bs, (AB, (АВ), 
£n [ABESSE вы) = өф) + ЭЖЕЕ + Ков) 


Where ¢2 = 1/w and oè + Ков = L/w! (see formula (Х1-5)). 
The variance of a difference between а pair of adjusted means of treat- 
ments not appearing together in an incomplete block, say 00 and 11, is 


o B Yjwt*: 
vie Dant КОО 


V (zw' = #1) = 


= ль... + (AB) = (АВ) 
КАВ)» — (АВ + a Yo n} 


дво 4507) , 2k —2) = 2 23.8988. n (X1-126) 
7 Blo А o) Ikw klo + w 2 


(see formula (XI-6).) 


Е 


Toa THE THREE-DIMENSIONAL TRIPLE LATTICE (THE CUBIC LAT-. 
CE) 


There are three types of comparisons among pairs of varieties in the 
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three-dimensional triple lattice. These are exemplified by the following 
comparisons among adjusted treatment means: 


ox! — Фо, Zooo — Zon’, and Ty» — п. 


Each of the comparisons is subject to different variances. Thus, 


со гыре vie = Ale + (Bhe — (Bs + (Co = (dena 
a САВ), — (АВ), + --- + (BCE) — (BC), uua 
Ё? 
4 (ABO) = (АВО), +... + (ABC), — ers) 
NENNEN EM т E LEE ARI 


2(1) чы (2)(2)(k — 1) "n 2(k — 1)? 
(о + 207) ' Row) ЗЕ 


=2) 1 2(k — 1) Gn 
E ae egi БЕЛОК БС (ХІ-127) 


(see formula (X1-69)). Also, 
VE! — Zon!) = у) = (4), + (By = Bi + Oe = (Oera 


+ UAB)o— (АВ), + +++ + (ВО), — (ВО), +... + (ВС), = (BC) gourd 
p 


n (ABC) — (ABO), + ... + (АВС), — (АВС), 
ЕКИ ur oeste uan 


= 22 __, 20k—4) 26-1) 2) 
Bo 207) Elu 4 o) t [m 


= 2 3k—4 | (k—1)(k—2 
т ae Faw + Rpg ta (XI-128) 


(see formula (XI-70).) 
Ya! у= vien. = (4), + (В), - (В), + (De = (Сы 
+ КАВ) = (АВ), +... + (ВС), (BG, 


+ ABCs — (АВО) +... + (ABC — (ABC), 
TL e y S 


eC) 2(3)(k — 2) | 2(k зь 3 
PUHAN i BOs Fal) t е Бш 


220) WW) 3(k —2) | kt — 3p 
els + 27 CU», Ya + = 8 
(see formula (XI-71).) 


(X1-129) 


CHAPTER XII 


Lattice Designs with More Than One Restriction on 
the Allocation of Treatments in the Complete Block 


XII-1 Introduction 


The experimental units within a complete block are often subject to more 
opriate experi- 


нап. one source of variation. Recognizing the need for аррг‹ 
ental designs for such conditions, Yates [323, 324, 328] developed a group 
wo- or more-way elimination of hetero- 


of incomplete block designs with t 
geneity among the experimental units within the complete block. The most 


Populer and widely known of these designs are the semi-balanced and balanced 

mro squares [328]. A number of other workers [52, 104, 106, 171, 175, 178] 
ave added to the group of designs proposed by Yates. 

be апше of more than опе type of variation within a complete block may 

E ound in most fields of experimentation. For example, in field husbandry 

үү кы, soil gradients may extend in two directions. The day-to-day 
nd worker-to-worker variations in laboratory experiments represent two 


ONE of variation which may be controlled with the lattice square design. 
atch-to-batch and order-of-preparation or evaluation of material represent 
] experiments the litters may 


Quir examples of two-way variation. In animal ts the pa 
ОҢУ one source of variation with the time period on each individual within 
Sun үт forming a second source of variation; also, in certain types of experi- 
pa а third restriction in the experimental design may be used to control 
а ец due to size of animal. Under such circumstances, the whole plots 
RES be the litters, the rows of split plots in a whole plot might be the size 
of ы and the columns of split plots in a whole plot might be the time 
k observation. The two-way control of variation within a whole plot would 
orrespond to the ordinary rows and columns in a lattice square design. 
When recovering interrow, intercolumn, interwhole plot, etc. information, 
care should be taken to have sufficient degrees of freedom associated with each 
Variance. It is suggested that at least 10 to 14 degrees of freedom, and prefer- 
ae 16 to 20 degrees of freedom, be associated with each variance. If fewer 
ti an 10 to 14 degrees of freedom are available, then only intrablock informa- 
ion should be recovered, provided that there are St 


fficient degrees of freedom 
associated with the intrablock variance. In some instances the mean square 
377 
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due to rows, columns, or blocks is less than the intrablock error mean square, 
resulting in less information from residual or intrablock error than from the 
other source. The recommendation here is to set the mean square which is 
less than the intrablock error, E., equal to it; in this case, the adjustment to 
a treatment mean, due to this source, is equal to zero. For example, suppose 
that the row (eliminating treatment) mean square, £,, is less than Е, ina 
lattice square. The recommendation is to set E, equal to Е,; this results in 
zero adjustments for row effects. 


XII-2 Lattice Square Designs 


The two-dimensional two-restrictional lattice (the quasi-latin square or 
the lattice square) is an incomplete block design of & treatments in which 
the treatments are grouped into incomplete blocks in two ways, similar to 
the grouping by rows and columns in a latin square. The comparison of a 
two-restrictional lattice with a one-restrictional lattice may be likened to the 
comparison of the latin square design with the randomized complete block 
design. The relative efficiencies are also comparable, with the two-restric- 
tional lattice generally being more efficient than the one-restrictional lattice 
[50, 73, 168]. 

The efficiency of the lattice square design relative to the randomized 
complete block design has been obtained for a relatively large number of 
experiments. Cochran [50] found that for corn in 2 X 10 hill plots the increase 
in accuracy for the lattice square was about one replicate in six for 25 treat- 
ments, one replicate in five for 49 or 81 treatments, and one replicate in three 
for 121 treatments. Bliss and Dearborn [25] and Ma and Harrington [201] 
obtained similar results on other crops, but Cox [73] found somewhat greater 
advantages for the lattice square over all types of experiments. Cochran [50] 
found that a lattice square in three replicates resulted in a 15 per cent gain 
in efficiency over the triple lattice design; the data from three uniformity 
trials were used for this comparison. Johnson and Murphy [168], using oat 
uniformity trial data, found that the lattice square design was more efficient 
than the triple lattice design and that the amount of variation controlled БУ 
rows and columns in the lattice square design was roughly proportional to 
the width and length of the replicate; they point out that replicate shape 
affects apparent precision of the lattice design. Zuber [340], using corn uni- 
formity trial data, obtained similar results. h 

The layout of a lattice square design in an experiment should be 800 
ee D replicate represents what would have been used D d. 
elits Gian "x block design if one is to compute relative efficiency. vn A 
biased R pP randomized complete block design by intent resul a 
an attempt should "i efficiency of the lattice square. As with the latin Ш a 
RUNS sud col е made to obtain approximately equal informatio an 

umns. (If the whole of the variation within a complete block © 
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р сү with either rows or columns, a one-restrictional design should 
Eu 4 = me of the experimental unit can be altered in some experi- 
ra effect this, although Cochran [50] found that plots of 4 X 5 hills 
M Mc no more accurate than 2 X 10 hills; Bliss and Dearborn [25], 
en -foot single row, came to a similar conclusion. In general, it is sug- 
М E the variation in both directions be equalized and that the plot 
Er : е т justed accordingly. The same recommendation holds for laboratory 
hee louse experiments wherever applicable. In the absence of knowledge 
нв уагїайоп in field experiments, both the complete block and the 
mith SE unit should be square, provided that these shapes are consonant 
бут ural operations. In field and laboratory experimentation the rows 
E Wee order within the incomplete blocks which are the columns, 
E columns may be laid end to end, provided that the experimental con- 
iUm (e.g., contours or order of performing an observation) warrant such a 
The randomization procedure for the lattice square d 


o the treatments at random unless otherwise 


esign is as follows: 


(i) Assign the entry numbers t 
S indicated; 
(i) assign the arrangements to the replica 
(iii) assign the levels of the pseudo-effects con 
, 8b random within each replicate; 
(iv) assign the levels of the pseudo-effect con 
random within each replicate. 


tes at random; 
founded with columns to the columns 


founded with rows to the rows at 


Roe procedure of randomization preserves the arrangements ог the 
simil S of confounding pseudo-effects in each replicate. This procedure is 
ar to that for the latin square design in that the basic arrangement is 
preserved, 
ET and Federer [42] have tabulated a 
iss and have prepared twelve randomiz 
and 13 ү for k= 5, 7, 8, and 9, and six r 
the ER f а semi-balanced lattice square 18 
used. Е, (k + 1)/2) arrangements given by 
do (к or other lattice square designs а num 
+ 1)/2 or k + 1 may be taken from their listing. 


arrangements for balanced lattice 
ations of each of the k + 1 ar- 
andomizations each for k= 
desired, the first (k + 1)/2 (or 
Clem and Federer [42] may be 
ber of arrangements not equal 


Хп. 
2.1 SEMI-BALANCED LATTICE SQUARE 


Cs semi-balanced lattice square for А treatments as proposed by Yates 
<9] consists of (k + 1)/2 = г arrangements, with every treatment compared 


се other treatment in a row or in a column. Each of the pseudo-effects 
b we either with the rows or with the columns in one of the replicates. 
OP EU cdd arrangements are possible for k — an odd prime number or power 

prime number; i.e., Ё = 395, 1,9: 1; 13, etc. For example, in а 


25 
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5 X 5 lattice square with twenty-five treatments a system of confounding for 
the (5 + 1)/2 = 3 arrangements is 


Pseudo-effect Arrangement 
confounded with: 


Rows 


Columns 


Also, any other permutation of the six pseudo-effects may be used. Other 
methods of construction may be used; e.g., use of orthogonal latin squares 
. [305, 323], exhaustive enumeration, etc. 
'The general form of the analysis of variance for q sets of the basic set of 
(k + 1)/2 arrangements is 


178: ӨЛК Seer o: 0 5 ———s 


Mean square 


Source of variation df 
Observed Expected value 

oo ОЕ ВПАВ үз 2А IURE. сире" т е 

Replicate (г— 1) =q(k+1)/2-1 — == 

"Treatment (ignoring row and k?-] = = 
column) 

Row (eliminating treatment) q(k? — 1)/2 Е, od + (r — Dkeot/r 
Component (a) (а — 1)(k? — 1)/2 — с 4 Кор 1) 
Component (b) (k? — 1)/2 — e 4- (К — iep ar 

Column (eliminating treat- 
ment) a(k? — 1/2 E oë + (r — Dkoy7/t 
Component (a) (а — 1)(k? — 1)/2 — с + Коу? 1) 
Component (b) (k? — 1)/2 — e -4k(k— Loy?/(k de 

Intrablock (k? — D(qk—-q—2)/2 E ed 

Total 


КЗК + 1)/2 — 1 = = 


The component (a) sum of squares for rows represents the interaction 
sum of squares of the levels of pseudo-effects and the replicates in which t 
effects are confounded with row differences. Likewise, the component (a 
sum of squares for columns is the interaction sum of squares of the levels. 
the pseudo-effects and the replicates in which the effect is confounded vit 
column differences. (These two sums of squares are of the same nature 28 
component (a) sum of squares obtained in one-restrictional lattices.) 

The component(b) sum of squares (similar to the component (b) 8% ы 
squares in tho one-restrictional lattice) is composed of the sums of squat F 
of the contrasts of the levels of the pseudo-effects in the replicates in whic | 
the effects are unconfounded with the same levels in the replicates in s 
the effects are confounded. In semi-balanced lattice squares, pseudo- i 


m of 
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confounded with rows are unconfounded with columns, and vice versa. Such 
a scheme of confounding obviates the necessity of removing column effects 
from row effects, and vice versa, since these effects are independent. 
A Effort should be made to obtain equal information on all effects. There- 
ore, rather than repeating the basic set of (k + 1)/2 arrangements, it is 
recommended that additional arrangements be used. For example, instead of 
using two sets of (k + 1)/2 arrangements of a semi-balanced lattice square, 
? is more desirable from the standpoint of all-around efficiency to use a 
alanced lattice square with k + 1 arrangements. 
M Homeyer el al. [160] give detailed instructions for calculating the analysis 
variance table, for adjusting the treatment means, and for obtaining the 
Standard errors. They present procedures for both punched card and cal- 


culating machines. 


Y ү апше Хи-1. А E? = 3? semi-balanced lattice square example in (k + 1/222 
nr icates is given in table XII-1. The A effect is confounded with row differences and 
e B effect with column differences in replicate I and АВ with rows and AB? with 


columns in replicate II. 
the data are systematically rearrange! 
ibt ysis of variance are computed. The variety (ti 
Я E of the table, and from the totals of rows an! 
t А » AB, and АВ? are obtained. The totals for levels of the effects are placed opposite 
r COMES ies row or column total in the top part of table XII-2. The (А). — 
REP s — (В), (АВ). — r(AB)so, and (АВ?).ь — r(4B:s quantities are com- 
BAR in the same manner as for the double and triple lattices. The second (АВ). — 
ка Ju value is equal to 24 — 2(9) = 6. The sum of the (4).; — 2(4): = —8 must 
m al the sum of the (B).; — 2(B; = —8 in replicate I and similarly in replicate п. 
к, e of the (A).; — 2(A)1; and the (4B).; — 2(AB)s; must equal zero in the 3? 
Al i alanced lattice square; i.e., the row contrasts must equal zero. Also, the sum of 
т. column contrasts must equal zero; thus, 8 ++ (—8) = 0. » З ч 
in HO last row and column of the tables headed “Replicate I” and Replicate Ir 
m e XII-2 are obtained after computing the analysis of variance а a yeighting 
ors. (In this case, no row and column corrections are obtajad D 


zero degrees of freedom for the intrablock error sum of squay? ANS 
» 


d in table XII-2, and the totals for the 
reatment) totals are given in the 
d columns the levels of the effects 


out for a semi 


TAB 
LE Хїї-1. Yields and experimental lay 5 
(Variety numbẸrş 


3? varieties in two replicates 


Qo) 5 (01) 
(œ) » (20) 2 
г) 2 (0) 5 


124(А)уу 
1i=(A)j9 
1%=(А)ур 


2 Qo 3 (11) 7 
6 (20) 5 (20) 3 
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i lysis 
TABLE XII-2. Systematic arrangement of yields and totals required for the analys 
of variance 


(0) 8 (0) 3 ( 
(20) 3 Q) т ( 
(20) 5 (21) 5 (€ 


(AB*) 5), 
(дв) ү 


(AB?) | - 2(4в°), 


D 
z 


Variety totals 


(00) 14 (о) 5 (c) 7 
(10) 6 (11) 10 (22) 5 
(20) 7 (21) 5 (22) 15 


Source of variation 


Replicate 


Variety (ignoring row ana column) 
Error (r. c. b.) 


Column (elininating variety) 
Row (eliminating variety) 
Intrablock 


Correction for mean 


The sum of squares of the quantities, 


DUA = rU) УА): — (А), Д) + LIAB). — (АВ) 
4 


k(k? — 1)/4 Ek — J) k(k? — 1)/4 
(Z:B)., — (ABI: oar? 
KR? — 1)/4 


= (22? + (=3)? + (=3)? | 3.22, is equal t° the 
Aue d 22, 


(=8)2 ү 224 62 + 02 ге 
18 6 18 
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row (eliminating variety) sum of squares with r(k — 1) — = En = 1 = EM 
2 


— 4 degrees of freedom. 
The column (eliminating variety) sum of squares is equal to 


УКВ) у – (В? — (2:0). = Bull? | У[(АВ?).„ — г(АВ?)зһ]? 
R 


k(k? — 1)/4 EE — 1)/4 —1)4 
= {2[(АВ?°).„ = r(AB?)]]? 
NEU EGR— 1)/4 (XII-2) 
_ (5)? + (0+3 _ (-8)? | H+ H+ 0 8 
6 =a z — 7g = 1022 


ру 1) = 4 degrees of freedom. 
he intrablock error sum of squares with (&* — 1)(k — 3)/2 — 0 degrees of free- 


us is obtained by subtracting the row (eliminating variety) and column (eliminating 
13 wed sums of squares from the randomized complete block error sum of squares, 
44 — 3,22 — 10.22 = 0.00; in this example the residual sum of squares is equal to 


сее ай variation and all degrees of freedom are allotted. 
he weighting factor for the column corrections is equal to! 
| Eech _ 2Е.— Е.) _ 2(w — we) 
п= Ет 108; RR-DE. КК – 0ш + 2w] С) 


а NT 
nd the weighting factor for the row corrections is equal to! 


E.—E,  XE.—E)..  20— w) 
e kr— DE, kk-— DE. kik- w+ 2w] 57 


where r = (k + 1)/2, 
ш = E, (XII-5) 
ш, = (r — 1)/(rE, — Е.), coe 
(XII-7) 


" = 
nd w. = (r — D/GE, — EJ). 

tors are not computed, since they are 
-balanced lattice square. In the event 
computed as 


— 2(AB)so], ete. (XII-8) 


For thi А 

а в particular example the weighting fac 

eA tainable for k less than five for the semi 
is five or greater, the row corrections are 


ca! = МКА): — 2(А)ь], e! = AL(AB)-0 
and the column corrections as 


су = u[(B). — 2(В)1 си QUE) 


= u[(AB?)., — 2(АВ?) зь), ete. 
T : 
es corresponding row and column. corrections from each replicate are added to the 
кыр total to obtain the adjusted totals. Since every pseudo-main effect or inter- 
added is confounded either with a row or a column, there are k + 1 adjustments 
ed to each variety total. kA +2) 
Я е average effective error variance is е.|1 Ветонит £ | and the efficiency 
of t Р i 
is ee semi-balanced lattice square relative to the randomized complete block design 
е ratio of the two average effective error variances, 
1 
set 110 > 1, divide extreme right-hand side of formulae ООЗ 
a(k + 1)/2 in formulae (XII-6) and (XII-7). 


) and (XII-4) by q, and 
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randomized block error X 100 (XII-10) 
kA +p) 
zn] 
The average standard error of a difference of any two adjusted variety means 1s 


3E, МАЕ EX КЕТ} our 
Expte 3 тї = кы к=к) 


aug ] s is 
The standard error of a mean difference for two varieties appearing together in a row 1 


2E, (XII-12) 
IG — k+ Du], 
PETIT- DA (k+ Dy) 
and for two varieties appearing together in one of the columns is 
2E, (XII-13) 
k — 1p}. 
ka 2 t e+ DA + - Da 


XII-2.0 BALANCED LATTICE SQUARE 

The balanced lattice square [328] is made up of k + 1 arrangements (or 
sets of k + 1 arrangements) in which every pseudo-effect is confounded once 
with rows and once with columns in one of the k + 1 arrangements. This 
also means that any pair of treatments occurs together once in а row an 
once in a column. Balanced lattice squares are available for all prime nimh 
or powers of prime numbers (k = 2, 3, 4, 5, 7,8,9, 11, 13, etc. ; [see 42]). T В 
scheme of confounding of the pseudo-effects for a 5 X 5 lattice square 1$ 


Pseudo-effect 


Arrangement 
confounded with 


Rows 


Columns 


x ee 
or any other permutation of the k + 1 = 6 pseudo-effects in the first thr 
and in the last three arrangements. 

"The general form of the anal 


3 H ‘o set 
ysis of variance for q sets of the basic 
of k + 1 arrangements is х 


и Меап square 
Source of variation df Observed — Expected value. 
Replicate q(k-4-1)—1 — Fw 
Treatment (ignoring row and 
column) к 1 Р Е) 
Row (eliminating treatment; 
ignoring column) q(k? — 1) = 3 
Component (а) (9 — 1) (к — 1) —_ = 
Component (b) (k? — 1) as = 
Column (eliminating treatment 2/4 
and row) atk? — 1) E, od + (gk Dy 
Component (a) (9 — 1)(k? — 1) T сё + Кот з 
Component (b) (k? —1) de oe 4k Ве 
Intrablock Œ — 1)(qk —q— 1) Е, сг? 
а наанаа) н. зои 2226 
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Total qk*k +1) -1 m cs 
Column (eliminating treatment; 
ignoring row) q(k* — 1) = = 
Component (a) (а — D(k* — D L3 A 
Component (b) (k? — 1) = ҮЛ 


Row (eliminating treatment 


and column) q(k? — 1) E, оё + (qk — De,*/q 
Component (a) (а — 1)(k? — 1) = oe + Коу? 
Component (b) (k? — 1) = og + (k — оз 


The component (a) and (b) sums of squares are of the same form as for 
one-restrictional lattices and for the semi-balanced lattice squares. The sums 
of squares for row (eliminating both treatment and column effects) and for 
column (eliminating both row and treatment effects) add a new feature to 
the analysis, The method is illustrated below with an example. Also, Homeyer 
et al. [160] describe the numerical procedure on both punched card and cal- 
culating machines; their symbolism is retained in the present example. 


Example XII-2. An illustrative example of a k? = 3? balanced lattice square in 
k+l=4 replicates with the row, column, and replicate totals is given in table 
XII-4. Effect A is confounded with rows in replicate I and columns in replicate IIT, 
effect B with columns in replicate I and rows in replicate III, effect AB with rows in 
replicate II and columns in replicate IV, and effect АВ? with columns in replicate II 
and rows in replicate IV. 

Table XII-4 contains the totals necessary for comp 1 
The column headed variety total — V contains the totals of the varieties from tlie 
four replicates. The total yield for variety 01 is34-24-24-3— 10. Column 3 with 
the heading SR represents the sum of the row totals in which the variety appeared. 
For example, the value SR for variety 02 is obtained as 14 + 9 + 13 +9 = 45. The 
Sum of the SR values should equal k times the grand total, kG = 3(146) = 438. The 
column headed SC contains the total of column sums from the individual replicates 
for а variety, For variety 01 the SC value is 13 + 7 + 13 + 13 = 46. The sum of 
the SC's should equal k times the grand total, 3(146) — 438. Column 5 is the бего 
of columns 3 and 4; thus, for variety 01, SR — SC = 44—46— —2— D. The E values 
in colum 6 are obistmed'as И ЕСК L DSR LA аиа 
+146 — (3 + 147 = —6. The J values are the sum of the corresponding D and L 
Values; for variety 21, J= D+ L/ = —23412-2K- J+ (k— 0р; for vari- 
ety 20, K= уф (ie DD =4+(3— 1)(0) = 4. М = р + К; for variety 00, 

"= D-- К =1 +0 = 1. The sum of any of the columns 5 to 9 should equal zero 
exactly. As a farther check, M’ = kV + G — (Е + 1050; for variety 01, M' = 3(10) 
Ee (3 + 1)46 = —8, which checks with the value obtained before. 

The replicate, variety (ignoring row and column), and the total sums of squares 
(table XII-5) Ате СШ Ө, in the usual manner. The various row and column sums of 
Squares are computed as the sums of squares of the L’, J, K, and М” values. The row 
(eliminating variety effect but ignoring column effect) sum of squares with 
(+ 1)(k — 1) = k? — 1 = 8 degrees of freedom is 


uting the analysis of variance. 
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TABLE XII-4. Yields, experimental layout, and totals for a balanced lattice 
square design with 3? varieties in four replicates (Variety numbers 
in parentheses) 


Replicate 1T 


Е wec- |+ 10| + (к - 1)рјр+к 
(к + 1)58 = 11| =J = К = Mt 
-2 1 


о 
-6 
n 


1 
wan Бом 


о|уюь F 


Sun 146 =G 438 438 


DL)? _ (—3)? + 02 + --- + 12 + (—15)? 
ean nai (—19* 4148. (КП20) 


The row (eliminating variety and column effect) sum of squares with r(k — DS 
8 degrees of freedom is 


WI? (2): +. (22)*4- ee 4 (—. e 
Е(Е — 1) ‘ а DECOT сезу. (K-19) 


The column (eliminating variet; i i ith 
y effect and ignori uares W! 
r(k — 1) — 8 degrees of freedom is Y NER 


DOE Пр (вун... yee 
EI) Е а шуыш 5 KO) 


The column (eliminating both vari uh 
ety and ES 
(k + 1)(Е — 1) = 8 degrees of freedom is ee c PT 


E(k —1) 


аи оа аара са 
m ee T1407... (XO. 


The sum of the row (eliminating variety and ignoring column effects) and the column 
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TABLE XII-5. Analysis of variance 


Source of variation 


Replicate 
Variety (ign. row and colum) 
Error (r. c. b.) 
Row (1gn. column; elim. variety) 
Column (elim. row anà variety) 


Colum (elim. variety; ign. row) 
Row (elim. variety and column) 


Intrablock 


(eliminating variety and row effects) sums of squares should equal the sum of the 
column (eliminating variety and ignoring row effects) and the row (eliminating vari- 
ety and column effects) sums of squares within rounding errors; thus: 4.148 + 11.407 
= 15.555, and 9.926 + 5.630 = 15.556. Either of the above sums is subtracted from 
the randomized complete block error sum of squares to obtain the intrablock error 
Sum of squares, 19.11 — 15.556 = 3554, with А2 — 1) — 208 — D = E — k — 
2k? + 2 = 8 degrees of freedom. 
The row weighting factor is 


л (Er — EQQE,— E) _ (0-704 = 0444)[3(1.426) — 0.444] _ 9 9564, (KII-18 
-IEEE -EA (3— 1000.704) (1.426) — (0.444)"] 0.0564, (XTI-18) 


and the column weighting factor is 

_ (E. — E.)(kE, — Е) _ (0.426 = 0.444)[3(0.704) — 0.444) _ 1 

и = (Ec — EQ(RE, — E.) _ 050 — OD (1426) — 0.40] 0.0927. (XII-19) 

(k--Q)REE.—E2S (3—1) [9(0.704) (1.426) — (0.444)?] 

. Тһе adjustment for a variety total is pM! + MA 

d CP) “+ (.0564)(—3) = — 0.0765, and the adjus 
= 7.231, 

The standard error of a mean difference betw 


| PO ч 200.444) = 0.567. 1-20 
i Tl +A +] = 20A + 3(0.1490)] 0.567. (XII-20) 


There is а single standard error of a mean difference, since all comparisons are of 


equal accuracy. 
The average effective error variance i8 


for variety 00 the adjustment 
ted mean is [29 + (—.0765)]/4 


een any two variety means is 


Ел + kA + 01 = 04440 + 3(0.1491)] = 0.643, (ХП-2) 


us the efficiency of the balanced lattice square relative to 
lock design is 
0.796 J 100 = 124 рег cent.(X11-22) 


randomized complete block error = 
x 100 = 
ЕТА 9 oe 


the randomized complete 
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XII-2.3 OTHER LATTICE SQUARES 


- It is possible to set up lattice rectangle designs for k(k — 1) treatments, 
but the usefulness of such designs in experimental work is limited. As an 
example, suppose that fifteen workers of three skills are available, that these 
workers are capable of performing only four treatments per period, that twenty 
treatments are to be compared, and that the fourteen batches of material 
differ with regard to the characteristic of interest. A possible design is the 
following: 


Skill I Skill II Skill III 


Batch Worker No. 


2| 3| 4 

7| 8| 9|10 
11 | 12 | 13 | 14 | 15 
16 | 17 | 18 | 19 | 20 


Designs of the above type are constructed by deleting the last k treatments 
from а set of treatments numbered from one to k?. In such designs, each pair 
of treatments is compared either in a row" (batch) or in a “column” (worker). 
Also, two or more of the arrangements may be used. If worker and batch 
variation estimates are of no interest and if a fifteenth batch is available, an 
alternative design is the triple rectangular lattice design with the worker and 
batch differences being completely confounded; ie., each worker completes 
one batch. 

The partially balanced lattice squares consist of lattice squares with 
2, 3, +++, (k — 1)/2 arrangements of the k X k lattice square [52, 178]. For 
example, if three arrangements of the 7 X 7 lattice square are desired, the 
following scheme of confounding may be used: 


Pseudo-effect 


See. |) c _ __ 0. 
1 2 3 
De MES LL 
Rows A AB AD: 
Columns B AB? ABA 


If a fourth arrangement, confounding the AB5 and АВ" effects with rows and 
columns, respectively, is used, a semi-balanced lattice square results. If a 
fifth arrangement (e.g., confounding A with columns and B with rows) ÍS 
used, the design becomes unbalanced. Designs of both types are illustrate 
in sections X1I-2.3.1 and XII-2.3.2. 

Another type of confounding suitable for k = 6, 10, and 12 has been pro- 


posed [106, 178]. The three pseudo-effects A, B, C from a set of k? treatments 


are each confounded once with row and once with column differences. Also: 
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each effect is unconfounded in one of the three arrangements (table XII-8). 
For k = 12 it is possible to set up four arrangements such that each of four 
pseudo-effects is confounded once with rows and once with columns and each 
is unconfounded in two of the arrangements. 

A fourth type of lattice square has been proposed by Kempthorne [175, 
sec. 24.4] for p^ treatments in a lattice rectangle of p* columns and p’ rows 
(s + г = n). For example, thirty-two treatments could be arranged in eight 
columns and four rows. Kempthorne proposes а scheme of confounding cf 
the pseudo-effects in three arrangements such that intrablock information 
is obtained on all effects. 

XII-2.3.1 Analysis for 2, 3, с, (k — 1)/2 arrangements. If one 
or more replicates of a semi-balanced lattice square is lost, if it is necessary 
to use fewer replicates than (k + 1)/2 because of lack of material or facilities, 
ог if a greater degree of precision is not required, the result is a lattice square 
design with less than (k + 1)/2 arrangements (52, 175, 178]. A k X k lattice 
Square in two replicates is selected to illustrate the method of analysis for 
Partially balanced lattice squares. Suppose that pseudo-effects A and AB 
are confounded with row differences and Band AB? with column differences 


in the two arrangements; thus: 


Pseudo-effect Replicate 
confounded with 1 2 2 
МЕРС ЕРЕН ЕНЕ Е ———— 
Rows А AB 


НН Bene 


If X,.., X,.., Xij, and X... represent the replicate 1, replicate 2, treatment, m 
&rand totals, respectively, and if the totals for levels of effects are represente 
аз before, the analysis of variance and the sums of squares are as ета 
table XII-6. The sums of squares for row (eliminating treatment) and 3 
column (eliminating treatment) are of the same form as the component (b) 
Sum of squares in the double lattice design. 

d from the following formulae: 


Th ; i 
e weights are estimate RU. xdi 
ш, = 1/@Е, — Е). (XII-24) 
(XII-25) 


v, = 1/QE. — Ё). 
The row adjustment factor for the differences (А) — 2(A) and (AB). 
~ 2 А М D 
(АВ), is equal to шш. (X11-26) 
k(w + ш) 
The column adjustment factor for the differences (B) -u — 2(B). and (АВ?) .„ 
Бер) z 
(AB?);, is equal to wa ш, (XII-27) 
(ш + ш) 


TABLE XII-6. Analysis of variance for а k X k lattice square in 2 replicates 


Source of variation ms 
Observed Expected value 


Replicate Z 2 
Treatment am in) A bos 
(ignoring row and column effects) 5 E 
Remainder - estimate of By subtraction 
mrendomized complete block error 


x3 JEA) y -2(4),,]? + [(4), - 202), ]? 
uo 2k 


Row (eliminating treatment) 


ki JE) a - 208), ‚72 + (в) -2(27),]? 
оѓо i лр е г 


Column (eliminating treatment) 
ex? 


By subtraction or by direct computation as the 
(k = 3)(k = 3) interaction of levels of effects AB), Tm АБЕУ 


Intrablock error 
with replicates 
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Four adjustments, two each for rows and columns, are added to the unad- 
justed total to obtain the adjusted treatment total. 

The average effective error variance is 

2 2 2 ka м 
кыты крш? 2ш S’ IL 
and the average standard error of a difference between two adjusted means is 
the square root of formula (XII-28). The efficiency of this lattice square 
design is 100E,' divided by the average effective error variance (formula 
(XII-28)). 

XII-2.3.2 Analysis for more than (k+1)/2 arrangements. If 
more than (k + 1)/2 but fewer than (Е + 1) arrangements are used, the 
analysis becomes slightly more difficult. To illustrate, suppose that four 
replicates of a 5 X 5 lattice square are used with the following system of 
confounding of the pseudo-effects in the four replicates: 


Pseudo-effect 
confounded with 


Rows 
Columns 


mi-balanced lattice square. 


The first three repli : 
plicates form a se tice sq! 
If the system described in section XII-2.3.1 is utilized, the analysis of 


variance is of the form presented in table XILT. The formulae o m уз BE 

sums of squares are given in the table, along with the expectation о 

mean squares. i А h 
The weights are estimated from the mean squares 1n table XII-7 by the 

following formulae: UIS (XII-29) 

w, = 17/(24E, — TE), БЫУ 


апа 
We = 17/(24Е. — ТЕ). (XII-31) 


The various weighting factors are А7 (XIL32) 
500; + we + 2ш) 
ш — Wr (XII-33) 
Б(ш„ + we + 2ш) 
ү ш — Ur (XII-34) 
500; + 3w)’ 
— 0 — We (XII-35) 


5(we + 3и) 


coe 


TABLE XII-7. Analysis of variance for 4 replicates of a 5 X 5 lattice square 


Source of variation | df 


Replicate 3 


з =, A 


Treatment (ign. 2% 


г г 
row and column) Doni ET -x ., /100 


Remainder = est. 72 


of г. c. b. error Py auberactson 


z (to "me (A) yu JEB) wu (AB Joy J% (AB?) Кы (әз?) ]°+ Ке) Aa (в Ум 7} 


Row (elim. trt.; 16 
ign. column) 


Соо) YR CH, PG un, 17] /300 


= (t ww (А ha (А) Н (в) 247508) (В), Y ut {raa ut (aB? ы гав) асма") y) 


006.095 2, у(х, Xz, Ry, Y)/aso-[(x ә rx DS. 51/300 


Colum (elim. o2+0502/24 


trt. ana row) 


By subtraction or by direct computation of levels of effects with replicates in which 
the effects are unconfounded 


Intrablock 


E ( (A) BO) My PHL) 3B), yy FP} +22 (сов) ome (аз?) ыз?) 


сё+85са/2ъ 


Row (elim. trt. 
and column 


= (x, x. әх) эх, -mx, )°] = [x -ю„. PHO aS 05] 


z (в) Me), PHLOB) 027), PALAB) -(2*), PCa) „АУ F } (60 
= (Oe, nba, YP x, YQ a, Aes е 17/300 


Column (elim. 
trt; ign. row) 


§ XII-2.3.3] Incomplete Lattice Square 393 


Formula (XII-33) is the weighting factor for the differences (A).u — 4(A)1. 

em (B)., — 4(B),.; formula (XII-32) is the weighting factor for the differences 

eed WC e апа (B)., — 4(B)1u; formula (XII-34) is the weighting factor 

ge erences (AB) шы = 4(AB)2u and (AB?) ы = 4(AB3),,; and formula 

: is the weighting factor for the differences (AB?) ., — 4(AB?)», and 

(AB ) — 4(AB4)3,. After multiplying each of the differences by the appro- 

priate weighting factor, the four row adjustments and the four column adjust- 

ments corresponding to any treatment are added to the unadjusted treatment 
total to obtain the adjusted total. 

The average effective error variance of the above design is 

4j - .1 41 ayes 

3lu, wm. + dwt w, + 3u 

ean difference for any pair of adjusted 


1 
tata ms 


The average standard error of the m 


treatment means is 
[2 1 1 1 
2{_ БИЕ ЕЕЕ 5 И 1-37 
TN EE КИИ! es ) 


The average efficiency of this design relative to the randomized complete 
block is 100E,' divided by the value of formula (XII-36); E.’ is obtained from 
table XII-7. 

XII-23.3 Lattice square designs for k = 6, 10, and 12 in three 
replicates. Lattice square designs discussed in the preceding two sections 
are for values of k equal to a prime number or power of a prime number. 
Since the scheme of confounding for prime numbers is not appropriate for 
k = 6 and 10, the following scheme of confounding in the rows and columns 
for 36 and 100 treatments is suggested [106, 178]: 


Arrangement 


Pseudo-effect 
confounded with 


Rows 
Columns 


1 row differences and once 
in one of the arrangements. 


d once witl 


with column diff h is unconfounded 
ifferences, and each 1s c 6, 10, and АЕ 


Each of the three effects is confounde 


One method of constructing such designs 

and column numbers to the numbers in à k X k latin square such that the 

fret set of numbers refers to the А effect, the second to the B effect, and the 

third to the C effect. This is the method used to construct the arrangements 
f numbering is in the 


ìn table XTI-8. An added advantage of this system 0 П 
analysis of such designs оп punched card equipment. The various levels of each 


шч are already coded. The use of an z and a y in the 12 X 12 lattice square 
acilitates the coding for punched card analyses. 


Also, the first two digits, i and J, may be likened to the levels of effects in 


TABLE XII-8. Incomplete lattice square arrangements for 36, 100, and 144 


treatments 


6 x 6 incomplete lattice square 


Arrangement I 


Arrangement III 


000 130 220 330 4ho 
201 211 391 431 5%] 051 
202 312 №2 532 042 152 
505 413) 525 055 15 255 
hok 51b coh 13} 244 55% 


— — 


Arrangement II 


015 125 2355 345 


051 
101 
211 
321 
431 
541 


ola 
152 
202 
512 
loo 
532 


033 
143 
253 
303 
413 
525 


veh 015 
15% 125 
244} 255 
55h 345 
hok 155 


10 x 10 incomplete lattice square 


Arrangement I 


200 505 Hob 505 606 
211 52 35 61% 615 
220 321 №2 523 62h} 
259 330. №51 552 655 
28 349 ho бу 6р 
257 58 459 550 651 
266 367 468 569 660 
275 376 X77 5786 679 
284 385 486 587 688 
295  39& 95 596 697 
Arrangement II 
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TABLE XII-8. (continued) 


12 x 12 incomplete lattice square 


ox 
13x 
2% 
Nx 
= 
б9к 
19 
ex 
9ух 
xOx 
ylx 


a k X k factorial. This is the method followed in the palas к 
numerical example selected to illustrate the computations Гей schemes of 
Incomplete lattice square. It should be re-emphasized that nm e ans 
Confounding are available for values of Ё other than 6, 10, an f {о effects is 
ìt is possible to set up four arrangements such that ps our als 
confounded once with rows and once with columns, але 8 uon lattice 
ìn two replicates, The randomization procedure follows that ого 
Squares, Е K 

Example XIT-3. Uniformity trial data are available [340] үш mU eh 
from plots four hills wide by five hills long. Using these data, кы УОП 
twelve by fifteen hills (nine plots) were constructed. The variety numbers, 90%, %44, 


26 
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022, 101, 112, 120, 202, 210, and 221 are assigned to three rows and three columns 
in a group or arrangement in such a way that like numbers of the first subscript, 
say i, are in the same row and like numbers of the second subscript, say j, are in the 
same column in group 1; like numbers of the second subscript, j, are in the same row 
and of the third subscript, say A, in the same column in group II; and the like numbers 
of the third subscript are in the same row and like numbers of the first subscript are 
in the same column in group III. After constructing these arrangements the groups 
are randomly assigned to the replicates, and the rows are randomized and then the 


columns are randomized, thus retaining the above described arrangement for the three 
replicates. 


TABLE XII-9. 3 x 3 incomplete lattice square in 3 replicates (pounds 


of ear corn per 4 X 5 hill plot; variety numbers in pa- 
rentheses) 


79.1 


(202) (221) (220) 
27.0 25.6 21.5 


101 
с Gel от 


(сог) (011) 
27.8 26.9 


E 82.5 
[болеп [Г [шш 


82.3 


(120) (221) (022) 
27.9 27.7 50.9 86.5 
(ооо) (101) (202) 
25.6 26.7 28.3 80.6 
(210) (011) (112) 
28.5 27.8 26.4 82.7 


(ace) 
26.1 


(101) (221) (011) 
29.1 28.4 21.0 


(210) 
29.0 


Total over all replicates 


Another way of considering the arrangement in table XII-9 is to consider only the 
d s з ог the subscripts for the pseudo-factors a and b (this method works 
hi , equal to a power of a prime number). Following the notation of previous 
chapters, the К? treatments are designated as 00, 01, 02, 10, 11, 12, 20, 21, and 22- The 
pseudo-effects are А, B, AB, and AB? = AB*-1, The TA A and B are con- 
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founded wi i 

ори e "ei row and column differences, respectively, in replicate I; B and AB with 
cum E шап differences pee in replicate II; and АВ and A with row and 

В s, respectiv i i ini 
Ret ee noe рее у, in replicate III. The remaining two-factor inter- 
The vari базаи 
be ADU s os (in parentheses) and the plot yields are 
XII-9) for the k? = plots in each of the thi li 
don and replicate totals are given also. doped. 
S Ea two methods of analysis presented below (pseudo-effects; row and column 
ЕЙ = total, replicate, and variety (ignoring row and column effects) sums of 
e computed in the usual manner for a randomized complete block design. 


(i) Method of pseudo-effects 


For this method of analysis it i 

eas? of analysis it is necessary to construct a table of totals for the 

саза о effects A, B, AB, and АВ? for each of the replicates. The k = 3 treatment 

lest dm m. ae up the zero level of effect A, or (A)o, are 000, 011, and 022 (the 
Ё рї will be omitted hereafter) = 00, 01, and 02. - 

plots) in exea ) an The (A)o total (of k = 3 

(AJo = 27.6 + 26.9 + 27.8 = 82.3. 


Te totala fox the levels of all effects are presented for each replicate in table XII-10. 
The Е 5 levels of each effect (lower half of table XII-10) are explained later. 
Coli toti B of effects А and B in replicate I may be obtained from the row and 
levels of eff. 5; also, the row and column totals may be used to obtain the 0, 1, and 2 
effects B and AB, and AB and A in replicates II and III, respectively. 


TABLE XII-10. Totals for levels of each of the pseudo-effects A, B, AB, and AB* 
by replicate and weighted effects per level 


Replicate II 


.0 8. 


Weighted effects per level 


81.4069 
85.6560 
82.8911 
81.7555 


3(990.6668) = 2972-0004 


3 (Total) 


@-1) Sum of squares for row (eliminating yariety and ignoring column effects) 
ean of the effect confounded in rows with the 


The comparison, by levels, of the m 
cates yields the sum of squares for row (elimi- 


m : 
ean of the effect in the remaining repli 
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.nating variety and ignoring column effects). The sum of squares for effect А is 
[84.3 + 79.7 — 2(82.3)]? + [81.0 + 86.3 — 2(82.3)]° + [84.5 + 83.5 — 2(79.DT* 
6k — 18 
[249.8 + 249.5 — 2(243.7)]* d 

DES M UE 0190.7 
with k — 1 = 2 degrees of freedom. The pooled sum of squares for A, B, and AB- 
with 3(k — 1) = 6 degrees of freedom is 

3.1382 + 10.1226 + 22.1137 = 35.3745. 

(i-2) Sum of squares for column (eliminating both variety and row effects) 

The comparison of the level of an effect unconfounded with row or column differ- 
ences with the level-of the effect confounded with column differences yields the sum of 
squares for column (eliminating the row and variety effects). For example, effect Bis 
confounded with columns in replicate I and is unconfounded in replicate III; the result- 
ing sum of squares is 

[85.7 — 83.4]? + [80.4 — 81.8]? + [83.4 — 78.5]? [249.5 — 243.7}? 
2k = 6 2k? = 18 
= 3.3411, with (k — 1) = 2 degrees of freedom. 
The pooled sum of squares of this comparison of effects A, B, and AB is 
8.3700 + 3.3411 + 0.5678 = 12.2789, 
with 3(k — 1) — 6 degrees of freedom. 


(i-3) Sum of squares for column (eliminating variety and ignoring row effects) 


The comparison which yields the sum of squares for column (eliminating variety 
and ignoring row effects) is similar to that for row (eliminating variety and ignoring’ 
column effects). The sum of squares for the comparison of the level of an effect uncon- 
core with columns with the effect confounded with column differences for effect 

is 


[82.3 + 84.3 — 2(79.7)]2 + [82.3 + 81.0 — 2(86.3)]? + [79.1 + 84.5 — 2(83.3)]* 


192.9 T 84.5 — 2(79-7)]" + 182.3 + 81.0 — 2(86.3) j- + 179.1 + 64.0 — Airi 


6k — 18 
_ [243.7 + 249.8 — 2(249.5)]° 
з= = 1.1610, 


mo k — 1 = 2 degrees of freedom. The pooled sum of squares for effects A, B, and 
B is 

7.7670 + 6.6504 + 9.0004 = 23.4178, 
with 3(k — 1) = 6 degrees of freedom. 


(1-4) Sum of squares for row (eliminating both variety and column effects) 


The sum of squares for row (eliminating both variety and column effects) is similar 
to the sum of squares due to column (eliminating both variety and row effects). The 
sum of squares for the comparison of effect A confounded with rows in replicate Dans 
unconfounded in replicate IT is 

[84.3 — 82.3] + [81.0 — 82.3]? + [84.5 — 79.1]? [249.8 — 243.7]? 
2k=6 ame 18 
= 3.7411, with k — 1 = 2 degrees of freedom. 
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The pooled sum of squares for А, B, and AB is 
3.7411 + 6.8133 + 13.6811 = 24.2355, 


ш, total of 3(k — 1) = 6 degrees of freedom. 
e sum given in (i i tn (i : 
ES ron nS ae given in (1-1) and (1-2) should sum to those in (i-3) and (i-4) 
35.3745 + 12.2789 = 47.6534 and 
23.4178 + 24.2355 = 47.6533. 


m a partial check the sum of the values in the correction terms in either of the 
ums of squares given above should equal zero; for example, in (1-1), 


119 — 64 — 5.5 — 0. 


De intrablock error sum of squares may be obtained by subtracting the sums of 
both es for row (eliminating variety; ignoring column) and for column (eliminating 
variety and row) from the randomized block error sum of squares; thus: 


58.4926 — 35.3745 — 12.2189 — 10.8392, 


degrees of freedom (table XII-11). Also, 


with 16 — 6 — 6 = 4 = 2(k — 2)(k — 1) 
be computed directly as the interaction 


тоташса: error sum of squares may 

Е of levels of the effects and the replicates in which the effects are uncon- 

unconfound erus effect. unconfounded in more Шап one replicate 1s AB?, which is 

AB? with ed m all three replicates. The interaction sum of squares of levels of effect 
replicates is obtained from the following two-way table: 


Level of Replicate T 


effect AB? 

0 

1 

2 
Total 243.7 249.8 249.5 743.0 
79.62 + ... +80.7? 245 +: + 244.7? 243.2 Fo t 249.5? 
Li шы айы ami E а ааа 240.0 — 

k=3 3k = 9 B=9 
ш D. — 20,464.6600 — 20,451.1933 — 20,448.8867 + 20,446.2593 
f squares is equal, within 


Jee with 2(k —1) = 4 degrees of freedom. This sum o 
nding errors, to that obtained by subtraction (table XIII. . f 
is From the mean squares and the expectations of mean squares In table XIMI, it 
«Possible to obtain the various row, column, and intrablock weights used for adjusting 
E Variety means. The reciprocal of the expected intrablock variance 1s defined as the 
ount of information, w, for intrablock comparisons and is estimated by the recipro- 


са i 
of the intrablock error mean square, 
_ __1__ = 0.369031. (XII-38) 


1 
ш = Fy, 2.7098 


є 
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TABLE XII-11. Analysis of variance 


Source of variation Expectation 
of ms 


Replicate 2.6274 
Variety 
(ign. row and column) 11.€207 
Error (r. c. b.) 58.006 
Row (elim. variety; 
ign. column) 35.3745 
Column (elim. 
variety and row) 12.2189 
Column (elim. 
variety; ign. row) 23.4178 


Row (elim. variety 
and column) 


24.2355 


Intrablock 10.8392 


Likewise, the reciprocals of the expected row (¢2 -+ kc?) and column (0,2 + key?) 
variances are defined to be the amount of information obtained on row and column 
effects, ш, and w., respectively. They are estimated by 


4 1 i 
Ш, = oe = ragag = 0.186268, (XII-39) 
and 2E, — Е. 5.3686 
Ше SETTE 1 EST 3s = 0.722961. (XII-40) 


The amount of information obtained on row and column comparisons theoretically 
should be no greater than the information on within row and column comparisons. 
However, sample estimates of w, and ше are sometimes larger than w. Whenever this 
is true, the value that is larger is taken equal to ш. This results in zero corrections 10 
the mean for the row or column. In this example, w, is larger than w and therefore is 


set equal to w, resulting in zero adjustments for columns. Since this is an illustrative 


example, the rule will not be followed and the estimates of шу, we, and ш as found will 
be used. 


The levels of effect A are estimated with variance 1/w, in replicate I, 1/w in repli- 
cate П, and 1/w. in replicate III. Weighting a level of an effect. (table XII-10) 


inversely to the variance with which it is estimated, a weighted mean (of k plots) is 
obtained. The weighted mean 


(of k plots) for th le f table 
XII-10) is equal to r the one level of A (lower half o 


0.186268(82.3) + 0.369031 (81.0) + 0.722961(86.3 
0.369031 + 0.722961 + 0.186268 = w + = P pane: 
The remaining weighted levels of the € 
manner. The weighted levels of AB? 
cates, since the amount of informati 
Using the weighted levels of the 


ffects (table XII-10) are obtained in a similar 
are obtained as the average from the three repli- 
on in each replicate is equal to w. 

effects from table XII-10 and the designated levels 
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in table Шыр the weighted mean for, say, variety 01 = 011 is 
(са (в. (АВЕ (ав) 1) — ki ŒI) 


1 
= 3( 81.4069 + 81.5270 + 81.6078 + 81.5667) — 3089) 26.15. 


The remaining adjusted means (table XII-12) are obtained similarly. 
TABLE XII-12. Table for obtaining levels of an effect to use in adjusting variety means, 
adjusted and unadjusted variety means 


Treatment 
Or variety level of effect 1 ss 
Combination (л), (в), (АВ), y (57), 4 әд Adjusted а 


о ооо ннн ооо 
оро MRO оно 
кою ONH NRO 

о jorn nor pno 


(i) Method of row and column totals 
us ache method of analysis for a k X k design constructed inan l 
ii bed earlier, may be generalized for k — any integer by making use of the row 
column totals in the three replicates. In order to do this, it is advantageous to 
Construct a table similar to table XII-14. It is not necessary to systematize the variety 
‘ae for each plot, but this may as well be done if the yields are recopied. Otherwise, 
е additional computations in table XII-14 could be included in table XII-9. 


incomplete lattice square, 


used to construct replicates I, 
heses) 


Replicate II Vg* 
Total 


202) 
(00) (0) О, 249.7 


o) (ои) (12) 
Qu) тул 80.8 


T. 
ABLE XII.13, Variety totals in the 3 arrangements 
II, and III (Variety numbers in parent 


(011) 
85.1 79.1 85.5 


(201 120) 
"oL 2.9 


81.6 83.2] 249.6 ЛИР 
2 
(209) (210) (221 | (120) (221 
m e (е) 247.1 || 84.2 80.7 83.5] 218. 
Ema [sme eer mer тә] 
= Replicate II Ге = 
Total 


(210) 
85.0 
e dw € 


120 
(20) 252.9 


23.0 


202 
е) qu GE aer 


249.6 247.1 743.0 


406 Lattice Designs with More Than One Restriction [§ XII-2.3.4 


The standard error of a difference between two adjusted means for a pair of varie- 
ties which appear together in a row and in a column (e.g., 000 and 011) is 


2 2 к—Фү_ [2 к. 
A eee + ==) =4 /202(0.782313) + 0.903266] = 1.28.(XII-46) 


The standard error of a difference between two adjusted means for a pair of varieties 
which do not appear together in a row and in a column (e.g., 000 and 112) is 


p 3 I-A p 
А ru Ее xS) E Nm = 135.  (XII-47) 


Ordinarily the latter standard error will be larger than the former one, since the 
comparisons within rows and columns are more precise than between rows and col- 
umns. The above discrepancy is caused by the fact that w, is larger than w; it should 
have been set equal to w for this case, but was not because it is desired to illustrate 
the computational procedure. The average standard error of a mean difference between 
any two adjusted variety means is 


2 3 k—2 2 c 
АЕ" (les WE FERA Зр ) NE + 0.903266] = 1.27.(Х11-48) 


As a partial check the value of the last standard error should fall between the former 
iwo. 


XII-2.3.4 Incomplete lattice square іп 3q replicates. The arrange- 
ments in table XII-8 may be repeated to obtain 6, 9, ··:, 3q replicates of 
an incomplete lattice square design. A different randomization is utilized for 
each set, as described earlier. The general form of the analysis of variance is 


Mean square 


Source of variation df 
Observed ^ Expected value 

Replicate За —1 Ds ч 
"Treatment (ignoring row and 

column) Sc x т 
Row (eliminating treatment; 

ignoring column) 3(k — 1) == E 
Column (eliminating treatment 

and row) 3k—1) - 
Column X replicate in which } E. 09е + (29 — 1)koy*/24 

effects are confounded 3(q — 1)(k — 1) 


Row x кыс in which effects 
are confounded = = 
Row (elim. treatment & col.) s sa 1) rf j RUN Mere DE 


Column (elim. treat.: i 
Intrablock error ыгал 20 


The interaction of TOWS and replicates and of columns and replicates sums of 


eid P odis as described previously for the component (a) sum of 
+ “ne expectations of the two mean s ot 
+ Res? respectively, ean squares are oê + ko? and ce 
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Example XII-4. If an additional randomization of the three arrangements in table 
XII-9 is used for a second set of three replicates, the analysis follows that given in 
example XII-3 except for a few additional calculations. The computational procedure 
for nine replicates or three sets of the original three incomplete lattice squares foilows 
directly from that given below. 

Uniformity trial data on the pounds of ear corn from a 4 X 5 hill plot were used 
to construct the additional set of replicates (table XII-16). The arrangement is the 
Same in replicates I and VI, II and IV, and in III and V. In replicates I and VI the 
first subscript remains the same throughout a row, the second subscript remains the 
same throughout a column, and the third subscript varies from 0, 1, to 2 = k — 1 
within a row and within a column. In terms of pseudo-effects, the levels of effect A 
A confounded with row differences and of effect B with column differences. The 
eae and AB? are unconfounded with row or column differences in replicates 


TABLE XII-16. Second set of 3 replicates of a 3 X 3 incomplete lattice square (see table 
XIL-9 for the first З replicates) Pounds of ear corn per 4 X 5 hill plot 


(Genet Replicate IV Replicate V 
asic arrangement as replicate II) (Same basic arrangement as replicate тїї) 


ON ELSE 
total total 


(210) (212) (оп) (222) (202) (сег) 
29.0 2.5 27.5 27.9 24.5 26.1 


(120) (c2) (221) (220) (210) (000) 
98.5 27.9. 2755 26.4% 25.4 25.2 


(ооо 
t Ium 


> “4h 28.1 19.9 
Ol. total 85.7 77.8 ё2.9 
Replicate VI 


(Same basic arrangement as replicate I) 


Row 
total 


(201) (221) (011) 
23.1 28.7 26.5 
Col. totel 77.4 


78.6 17.7 


(œ) (221) 

23.8 24.7 72.3 
(œ2 

Qi e 74.1 


75.5 


(01) (2120) 
24.8 25.6 


Coi. total (69.5 75:0 17.6 


VE Sums and differences necessary to complete the analysis of „variance (table 
-19) for two sets of the 3 X 3 incomplete lattice square аге given in tables XII-16 


t ^ 5 
o XII-18. The various sums of squares should present no difficulty, but if they do, 


co à 
mplete details are given in reference 106. 
he weights are 


w = 1/E. = .354258, (XII-49) 
and w, = 3/AE, — Е.) = -509200, (XII-50) 
w, = 3/(4Ec — Е.) = .341613. (XII-51) 


TABLE XII-17. Form for calculations 


Colum total 


Rep. III and V 
CA arrangement 
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TABLE XII-18. Row and column totals by replicate 


Rep. II Rep. IV 


Rep. II — Rep. IV 
| 


82.0 85.7 
82.2 8.9 
85.6 77.8 


83.4 73.0 156.4 
81.8 69.5 151.1 
77.6 156.1 


о 
1 
Em 
[5.6 [mar | 89.8 — 22 


4ob.2 


79.7 и 
86.3 TI 
83.5 78.6 


T. 
ABLE XII-19. Analysis of variance 


R lee O Тез E 
otal 
Co; 53 200.4081 


Replicate 

Yi 75.6525 

ЕВУ (ign. rov ana column) 3 19.3814 
= (т. e. v.) ho 107.3742 
ow (elim. variety; 1 у 

gn. column) 6 11.4750. 

Colum (clin. variety and rov) 6 8.5362 
т umn x replicate 6 25.8189 
Intrabiock 16 15.1652 
= 6 16.3789 


X replicate 
Ноу (elim vi T 
+ variety and column) 9. 16h 2.1786 = E 
Column (elim, variety; ign. row) 10.2467 = = 


Si 
ince w, > w, row adjustments should not be made, and Æ, should be set equal to 

As However, in order to illustrate the procedure, this is not done in the present 
ample, The weighting factors for obtaining adjustments for the means are 


= —.0071073, (XII-52) 


w — Wr 
3gk(w + w, + we) 
0 Ue -— 9903021, XII-53 
and Sgk(w + w, + we) i $ | 


Wr — W, 
L——— = = ——. —..0074094. XII-54 
F 3gk(w + w, + we) 4 
"hu this example the alternative procedure is followed in obtaining the adjustments 
the various means. The values V; — 3R; are each multiplied by equation (XII-52) 


410 Lattice Designs with More Than One Restriction [§ XII-2.3.4 


and the values V; — 3C; by equation (XII-53). Two adjustments are obtained for a 
variety mean in each arrangement. Therefore, the total number of adjustments is six. 
The adjusted mean is equal to the unadjusted mean plus the sum of the corresponding 
six adjustments (table XII-20). For example, the adjusted mean for variety 022 is 


ue — .0746 + .0068 + .1400 — .0042 — .0498 + .0023 = 27.49. 


TABLE XII-20. Unadjusted and adjusted variety means 


Unadjusted 
Mean 


The standard error of a difference between two adjusted means for a pair of varie- 
ties appearing in the same row and column is 


2 2 —2 
Sis байке! ао) 38 ОШ) 


The standard error of a mean d 
the same row and column is 


[2 3 k—3 
ЫН == ШӘ] on IS -56 
Bnei Lus 535 ) 91. (XII-56) 


The average standard error of a difference for any pair of adjusted means is 


ifference between a pair of varieties not appearing in 


2 3 ES 
Уд TD ш Еш m)-9 (XII-57) 


The average effective error variance is 


ut 3 k—2 
k+ е +) = 256. (XII-58) 
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XII-3 E Treatments in k Whole Plots, k Split Plots, 
and Е Split Split Plots, A Two-Restrictional 
Design 


Аз experimental conditions warrant, the following scheme of confounding 
may be utilized: 


Arrangement Whole plots Split plots 
I А B and А X B interactions 
п B CandB XC E 
ш с AandA XC 


The above scheme of confounding is similar to that used in a cubic lattice. 
The only difference is that the pseudo-main effects are confounded with in- 
complete blocks of size k? instead of k. The randomization procedure differs 
In that the k levels of a pseudo-main effect are randomly allotted to the whole 
Plots of size Ё?, Then, the levels of the second pseudo-main effect are randomly 
allotted to the k incomplete blocks of size k within each whole plot. Finally, 
the treatments within a split plot are randomly allotted to the split split 
Plots (the experimental units) within each split plot. (The randomization 
Procedure in each replicate is the same as for a split split plot design.) 

The sums of squares for the analysis of variance are computed in the same 
manner аз for a cubic lattice. The expectations of some mean squares are 


different from those for the cubic lattice; thus: 


GNE ee E 2510. 


Source of variation af ETE 
Replicate 2 = 
Treatment (ignoring whole and 

split plot effects) nt Дз i 
Fi len 3k — 1) od + kop? + К?се31/2. 
mponent (b) 3(k — 1) oe? + kog?/3 + k*o41/3 
Some, 3(k — 1)? се + 2kog?/3 
Intrablock 2p Sp z 


The true weights are 


ау (XII-59) 
and ш, = 1/(c2 + kog), (X1I-60) 
(XII-61) 


шь = l/(c& + kog + Ko). 


The average variance of a difference between two adjusted means is 


2 3 aes D) 11-62) 
ЕТ КЕЕ; 2w + Ws ш C 


27 
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In using such a design the experimenter should first determine the number 
of degrees of freedom associated with the various mean squares. If fewer than 
12 to 14 are available, one may be well advised to use additional replicates or 
to use a one-restrictional lattice design instead of the present two-restrictional 
design, unless the amount of variation removed by whole plots is considerable. 


XII-4 Three-Dimensional Three-Restrictional Designs 


Various arrangements of k? treatments with three restrictions on the allo- 


cation of treatments within a complete block are available. Two of these 
schemes of confounding of the pseudo-effects are 


i k? whole plots 
I — ——— ———————. 
UR Rows Columns Neto pim 
a ЕСЕБШ DSi —————9 
Replicate I A B All A X B interactions 
P п в с AlB XC x 
* ш с A АПА xX C я 
k? split plots 
Scheme 11 k whole plots Rows Cannas 
Replicate I A BC &A X BC BC? & A x BC? 
ci II B AC? & B X AC? AC & B X AC 
Ч ш (9 A&AXC B&BxC 


For 3? treatments the average error variance for Scheme I in 3q replicates 
is 


22501 CLA - 6 4 
ies +w + w am W, + 2ш 31 ) (XII-63) 
and for Scheme II is equal to 


2 ( 1 % 1 AN 
137 шу + w, + w’ We! E wj E" "má "E Wy = Qw’ / 


Ш ааа 1. 2. 

+ W Pac + SURE! xs 507 + =] (XII-64) 
pui [104] and Kempthorne [175, sec. 24.5] discuss Scheme I; Yates 
: ше design for Scheme II. Also, Kempthorne [175, sec. 24.5] 
teats a variation of Scheme II. A variation of Scheme I is to use kp? treat- 
ments in a p X p lattice square with split plots of k treatments. In addition: 
confounding in the split plots might be used [171]. 
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XII-5 Four-Dimensional Lattices with 
'Two or More Restrictions 


Designs for kt treatments may have two, three, or four restrictions placed 
on the allotment of the treatments within the complete block. Lattice square 
designs of k? rows and k? columns may be used. Such a design may be analyzed 
as an ordinary lattice square. Also, one could use two-restrictional designs of 
the following nature: 


Number of 
Pon Whole plots Split plots Split split plots 
— PEE 
1 k? k k 
2 k k? к 
3 k k k 
an So oao -Kou ee —— ——— —— 


The following three-restrictional designs are available for kt treatments: 


Number of 
Design 7 Split split Split split 
Whole plots Split plots lots split plots 
р! 
ЗИ rose 1 
: k k k k 
2 k? k X k lattice sq. = 
3 k X k lattice sq. k? 


A four-restrictional design that may be useful for some experimental 
Conditions is a k X k lattice square of the whole plots with the split plots in 


each whole plot arranged as a k X k lattice square. 


XII-6 Missing Data 


Cornish [67, 69] has developed missing plot formulae for the semi-balanced 
and balanced lattice squares; his formulae (without recovery of interblock 
Information) for these designs are 
Ху = Hb = (В + С) — (6-3), + 2k = ®Х + OX... В+ V) 

(k—3)k — 1) (XII-65) 


and 
Xen k(k — 1)(R + C) — (k + DX, + k(k — DX. 3X... — k(B + y) 
(k= 2) — DF (X1I-66) 


respectively, В, C, X,.., and X.;; represent the row, column, replicate, and 
Teatment totals, рб that contain the missing observation; X... is 
€ total of all rk? — 1 observations; В is the sum of row and column totals in 
© other squares which contain the treatment whose value is missing in R 
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and C (i.e., B is the sum of the row and column totals containing the treat- 
ment); V is the sum of the 2(k — 1) treatment totals for the treatments 
making up R and C. For each missing plot value computed, one degree of 
freedom is subtracted from the intrablock error degrees of freedom. Otherwise, 
after inserting the computed values, the analysis is completed in the same 
manner as for no missing values. 

Missing plot formulae for most other designs discussed herein and for 
other situations have not been developed to date. The methods described by 
Cornish [67, 69] may be applied to obtain missing formulae for these designs. 


XII-7 Tests of Significance 


Tests of significance of the adjusted treatment means for the designs in 
this chapter are in a less advanced state than for those in the preceding 


chapter. It is recommended that the following test be used to approximate 
the correct test: 


F (treatment and intrablock error df) — iis among adj usted treatment meats) 
average effective error variance 


(XII-67) 
The above test was suggested by Bliss and Dearborn [25] for a semi-balanced 
lattice square. With such a test, it is possible to partition the treatment 
degrees of freedom and to test the resulting mean squares. 

An alternative test would be to obtain the treatment (eliminating both 
row and column effects) mean square and compare it with the intrablock error 
mean square. Such a test is exact if the ordinary assumptions of additivity 
and normality hold, but it is inefficient, since the interrow and intercolumn 


information is ignored. Also, contrasts among groups of treatments are much 
more difficult to obtain by this method. 


CHAPTER XIII 


Other Incomplete Block Designs 


XIII-1 Introduction 


In addition to the designs discussed in Chapters IX through XII, there 
аге other incomplete block designs which merit discussion because of their 
usefulness in certain experimental situations. The partially balanced incom- 
plete block design and the balanced incomplete block design are two useful 
designs for controlling the variation among the b incomplete blocks of size Ё 
each, with the treatments not in complete replicates. Also, incomplete block 
designs with two-way elimination of variation may be used to control two 
Sources of variability even if the treatments are not in complete replicates. 
In still other experimental situations a variation of the split plot design may 
be used to obtain more accurate comparisons on the treatments within a 
Natural subgroup than on the treatments in different natural subgroups. 


ХШ-2 Incomplete Block Designs for v Treatments in 
b Incomplete Blocks of Size k 


. The designs in the present section were first proposed by Yates [320, 322] 
11 1936. He constructed several of the designs and suggested a large number of 
others, Since 1936, many useful designs of this type have been discovered 
and constructed [26, 27, 28, 30, 32, 175, 176, 226, 227, 255, 324, 326, 327, see 113 
for additional references]. Two basic types of incomplete blocks are discussed 
11 this section; i.e., the balanced incomplete block (b.i.b.) and the partially 

alanced incomplete block (p.b.i.b.) designs for v treatments in b incomplete 

locks of size k with each treatment repeated г times. 

he randomization procedure for these designs follows: 


(i) Allot the entry numbers to the treatments at random unless there is a specific 


Gi) A for not doing so. 3 
:Ш) Allot the groups to the b blocks at random. , AY: i 
ш) Randomly int the treatments to the k experimental units within each block. 


In the event that it is possible and desirable to arrange the treatments 
E Complete replicates (these designs are called resolvable incomplete block 
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designs by Bose and Nair [30]), the randomization procedure is the same as 
described in Chapter XI for any one-restrictional lattice design. 

The analysis for resolvable incomplete block designs is the same as the 
analysis for the b.i.b. and p.b.i.b. designs described in the following two sec- 
tions except that the replicate sum of squares is removed from the block 
(eliminating treatment) sum of squares. The expectation of E; then becomes 
a? + k(r — 1) с? /т instead of e? + (bk — v)eg!/(b — 1), as given below. 


XIII-2.4. BALANCED INCOMPLETE BLOCK DESIGNS 


XIII-2.1.1 One set. The particular lattice design composed of v 
treatments in b incomplete blocks of size k with the treatments not arranged 
in complete replicates is often referred to as the balanced incomplete block 
design [30, 60, 141, 175, 255, 320, 322] or as the symmetrical incomplete 
randomized block design [320]. This class of designs is useful for experimental 
situations requiring control of the variability only among the b incomplete 
blocks. 

For » treatments in b(b > v) incomplete blocks of size k(k < v) with r 
replicates on each treatment the condition of balance is fulfilled if each pair 
of treatments occurs together in the incomplete blocks an equal number, X 
of times, Also, the following conditions must hold: 


, 


or = kb. (XIII-1) 
(v — 1 = r(k — 1). (ХПІ-2) 


The efficiency factor, Eff, is defined [327] as the fraction of total information 
contained in intrablock comparisons when interblock and intrablock contrasts 
are of equal accuracy. The efficiency factor is equal to 
КОШЕЛЕК (0—1) uA 

E I—IATE UN 


The analysis of variance table for the balanced incomplete block design is 
327]: 


ne HO MEN C 


(ХІШ-3) 


Mean square 


Source of variation df ss Observed Expected value 
"Treatment (ignoring block) =l Т 
Block (elim. t; tment b г і By Y “a 
Block (elin treatment) " e a5 2 pı E сё + (kb P uii 
DENEN —.————— 3 — 
Treatment (elim. block: 
yee (ignoring treat.) | b E j ai = % a 
ntrablock kb-v—b241 E Е, oe — 
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where 
MX. AX 
m M r Е = то (XIII) 
- ERE 
B, = ss among blocks — | $ XSB)2— SOT /k(r — X) 
71 
+ У /гу( — k)(k — 1), (ХПІ-5) 
7-1 
» v р ° 
total ss = >, 9 X, — =, (XIII-6) 
tel j=l 


and E is obtained by subtracting the sums of squares in formulae (XIII-4) 
and (XIII-5) from the sum of squares obtained in equation (XIII-6); X.; 
= treatment total; X.. = total of all rv = bk observations; (SB); = total 
of all blocks in which treatment j appears (see example XI-4); and 


W; = (w — E)X.; — (0 — 1) (SB); + (k — DX... (XIII-7) 


Also, 

т, = Y0//Er(Ef)) = X-0//bus (хш) 

jel 
and В» = ss among b block totals 
Б ВЫ — X.3n (XIIL9) 

Where , 

О; = kX.; — (SB); (XIII-10) 
and 


В., = total of uth incomplete block. (XIII-11) 


In the event that E, < Е, the analysis for the completely randomized 
design is used, and the treatment means, 2; = Ж.у/г, are not adjusted. If the 
number of degrees of freedom associated with Ey is less than 12 and E» > E, 
ìt is recommended that interblock information be ignored and the treatment 
means be adjusted only for intrablock information. If, on the other hand, 
Es > E, and the number of degrees of freedom associated with E, is greater 
than 12 to 14, it is recommended that both interblock and intrablock informa- 
Чоп be utilized. 

The adjusted treatment mean utilizing only 
equal to the experimental mean, 2, plus а correction; thus: 


3 à/'— Z + Q,/rk(Eff) = 2 + (» — 1)Q;/ro(k — 1) 
With an average variance of a difference between two means of 


2E, k(o — 1) Y _ 2Е. v—k ) : 
KE = FA rok = 2) d cce me esi E C) 


ly intrablock information is 


(XIII-12) 
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The variance of a difference between two adjusted means is obtained by multi- 
plying by two. If interblock information is recovered, the adjusted treatment 
mean is equal to the unadjusted treatment mean plus a correction; thus: 


£j = 2; + uW;/r (XIII-14) 
with an average variance of a difference between two means of 
2k(v — 1) _ 2k(v — 1и PE 
rwwk — 1) F rwv — k) ^ r(w — w)’ (XIII-15) 
where 
ШЕ, (XIII-16) 
7m u(r — 1) i 
D T KE-—DE — @= BE: (XIIL17) 
and 
w — w 


~ wok — 1) + wo E) 
D (b — 1)(ЕЁ„— E.) 
202-006 EF 0- kb— Е, (XIIL18) 

The average effective error variance is r/2 times the variance of a difference 
for mean (see formulae (XIII-13) and (XIII-15)). The efficiency of the incom- 
plete block design relative to the completely randomized design is the ratio 
of the two average effective error variances for the two designs. The coefficient 
of variation is the square root of the average effective variance divided by 
the mean of the experiment, 2. 

In the special case where » = b and г = Ё, the above formulae may be 
simplified [60]. Some modifications of the formulae are necessary if the treat- 
ments are arranged in complete replicates or in groups of replicates. Yates 
[327] and others [30; 60, Ch. 11; 175, Ch. 26; 255] present formulae and dis- 
cuss the calculational procedure for these cases. 

It should be noted that the sum of squares for block (eliminating treat- 


ment effects), equation (XIII-5), may be obtained by another procedure; 
thus: 


block (ignoring treatment) ss 

— treatment (ignoring block) ss 

+ treatment (eliminating block) ss 

= block (eliminating treatment) ss. (XIII-19) 


Also, it should be noted that when v — b, formula (XIII-5) reduces to 


LW?/ro(v — k)(k — 1), (XIII-20) 


j=l 


since the sum of Squares amon i 
€ the block totals, B.., is equal to the sum 0 
Squares among the totals of blocks, (SB); k 


with other lattice designs, tests of significance have not been fully 
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developed to date. The same approximate tests described previously are 
applicable here. When interblock information is not recovered, the F test, 


FU Лапа 126 >= treatment cm block) mS (XTII-21) 


e 


follows the F distribution if the error deviations are independently and 
normally distributed with mean zero and variance с. When interblock in- 
formation is recovered, an approximate F test is 


Бо = Tandika — ms among adjusted treatments. 99у: 
алаша кыраш) average effective error variance CUIE-22) 


Individual comparisons among means or groups of means are easily obtained 
by the above formula. 

Cornish [64] presents a formula for obtaining the intrablock estimate for 
à missing value in a balanced incomplete block design. The formula is 


x, = wk — DBs + klv — 1)Q — (›—1)0/ 
ла k(k — 1)(bk —b -v+ 1) , 


where the Jth treatment in the uth block is missing; the uth block total is 
B.u; Q; is the value of kX.; — (SB); for the jth treatment; and Q;' is the sum 
of the Q values for all other treatments in the uth block. If more than one 
value is missing, a guess-estimate is inserted for all missing values but one; 
formula (XIII-23) is used to compute the value for the remaining missing 
Plot. The computed value is inserted for the missing experimental unit, 
formula (XIII-23) is used to compute the value for a second missing value, 
and the process is continued until the computed values stabilize. As usual, 
One degree of freedom is deducted from the intrablock error for each missing 
Value computed, If several values are missing, it may be advisable to analyze 
the experiment as a completely randomized design. Also, one may introduce 
Pseudo-variates, composed of zeros and ones, for the missing values and per- 
form an analysis of covariance [255]. 

The methods for constructing b.i.b. designs have been described by Yates 
[322] and Kempthorne [175, Ch. 26]. A number of plans for various values 
Of v, b, k, and r have been presented [26, 60; 175, Ch. 12; 322]. 


(X1II-23) 


Example XIII-1. Hanson et al. [141] compared ten methods of preparing dried 
°88 samples. A panel of seven judges tasted each sample. The average scores for each 
Sample, the panel means, are given in table XIII-1. The 10 = v treatments were set 
Up in b = 15 incomplete blocks of Ё = 4 treatments each. Every pair of samples oc- 
curred together А = 2 times in the incomplete blocks. The number of replicates on 
any particular treatment was r = 6. The ten treatments consisted of two controls and 
a 23 factorial, Treatment 1 was the control for treatments 7 to 10, and treatment 2 
Was the control for treatments 3 to 6. Treatments 3 to 10 formed a 2 X 2 X 2 factorial 
Combination of powders, moisture contents, and methods of packing. The present 
analysis considers the ten treatments as a single set of treatments. The partition of 
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treatment degrees of freedom into single degree of freedom comparisons is left as an 
exercise for the reader. Precaution was taken to insure that color differences did not 
affect the judges' scores. Sodium vapor lights were used to mask any color differences; 
the dried egg samples were reconstituted and scrambled before presenting the samples 
to the judges [141]. 


TABLE XIII-1. Panel means of scores of 10 dried egg samples (score of 10 = no off-flavor 
and zero — extreme off-flavor) 


Incomplete Treatment number 
block (period) 


1 8.7 - 5-4 5.0 - - - 5 " 

2 9.6 8.8 `- Me o = qm 
H ООА Чо! 95-8) dS x1 - 

4 93 - 87 - 68 - 3.8 - = - 

5 ООК о О 2 TED mE n БОЛ 
6 = 96 - - - ИБО “46, 15.61. e 

if 2 T- = - Th = - kd c v 
8 - - - КОНУ 6:31 2. 5.1. 20 
9 9.5 9.5 8.2 - - - - 525 - 
10 = Е - 8.1 9.0 6.0 - - 33 E 
1 97 - - - - 6.7 6.6 - - 2.8 
12 - = 955 01801 (= - - r1853:1552:6 
15 9.8 - - - т 1s} - 5.4 h.o = 
14 - - - 5.8 = ра 


= 9.0 8.3 8 . 
3 9.5 - 8.3 6.5 = 5. 


15 8 i 
57.8 56.0 55.5 45.3 45.9 35.7 30.9 26.2 23.0 17.6 391.7 
061 ӨЗҮ 8,9) ТО ТӨ бо 52 1828/07318; 2.9 
Range ӨЙ, mg 155m 35:5 3 1:6 3,8 1,8 Ге А 


TABLE XIII-2. Calculation of adjusted treatment means 


Treatment| Treatment |Sum of blocks|Q, = kX 
number |total = X containing a ü Adjusted mean 
Е) treatment j 
= (В), 


Беләсе 


1 
2 
3 
4 
2 
6 
T 
8 


ow 
TAE 
n= 
iv) 


П 
су 
E 
WO Oxo У o io ini io 


TO V4 фл 1 4 со 
Su 559899379 


591.70 =X 


К ү шшш the analysis of variance, it should be noted that treatments with 
(table ХШ). E es tend to have a larger range than do the remaining treatment? 
: However, the ranges of the deviations for a treatment rather than the 


гап 

ү, ie о Scores for the treatments should be considered. Since the rango 
à ompared to the differences among the treatment means and since they an 

not exceedingly discrepant, Separate anal 


e 
увез for the more variable treatments 27 
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not computed. Also, no transformation of the data is made prior to analysis, although 
an arcsine transformation may be appropriate. 
The various sums of squares are (see tables XIII-1 and XIII-2 for totals) 
Trealment (ignoring block) with 9 df (formula (XIII-4)): 
2 see 2 2 

ЭТФ + EITS _ SET = 2071.86 — 2557.15 = 314.71, 

Block (ignoring treatment) with 14 df (formula (XIII-9)): 
2 see 2 

mett PLT — SAT L 2574.22 — 2557.15 = 17.07. 

Treatment (eliminating block) with 9 df (formula (XIII-8)): 
66.9? + +++ + (—61.0)? + (—79.4)? 
= 321.51. 
kon = 4(10)(2) = 80 
Block (eliminating treatment) with 14 df (formulae (XIII-5) or (XIII-19)): 
164.3? + +++ + 149.8? — 1566.82/10 43,22 + +++ + (—67.5)? 
17.07 — 
p | 16 |+ 101000980 = оті 
= 17.07 — 15.69 + 22.50 = 23.88, 
ог à 
17.07 — (314.71 — 321.51) — 23.87. 
Inirablock error with 36 df (by subtraction): 
356.46 — 314.71 — 23.88 = 17.87. 


The weights are estimated to be 
w = 1/.496 = 2.016; 
106 =) = 0.5402 
f= eee AOR " . 
w =s — 1)0.706) — (10 — 4)(496) 
The weighting factor is equal to 
m 2.016 — 0.5402 EC. 1.4758 — 0.02316. 
4 = 5016010704 — 1) F 0.540200 — 4) 63.7212 
ation, is obtained by adding uW; to 
e, the adjusted total for treatment 2 
d totals are obtained in a similar 


The adjusted total, recovering interblock inform 
the unadjusted treatment total, X.j. For exampl 
18 56.0 + (.02316)(78.3) = 57.81. The other adjuste 
manner, 
TABLE XIII-3. Analysis of variance 
Source of variation 
Total uncorrected 


Treatment (ignoring block) 
Within treatments 


Block (eliminating treatment) 
Intrablock 
Treatment (eliminating block) 
Block (ignoring treatment) 
Intrablock 
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The efficiency of this design relative to the corresponding completely randomized 
design is 
0.835(100) 83.5 
4(9)(.02316)/(1.4758) ` 5650 - 149 Per cent. 


The coefficient of variation is (100) 4/.5650/6.53 — 11.5 per cent. The standard error 
of a difference between two adjusted means is equal to 4/2(.5650)/6 — 0.434 scoring 
units. 

XIII-2.1.2 More than one basic set. It may happen that more than 
one set of the balanced lattice design is required to obtain sufficient replica- 
tion on the treatments. For example, suppose that it is desired to compare 
? — 13 sweet potato varieties for quality, that treatment differences are small 
compared to the error variance, and that each taster or judge compares only 
k = 4 varieties at one time. For a b.i.b. design with b = 13 incomplete blocks, 
four replicates are required. It may be necessary to use sixteen replicates in 
order to detect differences of the order of magnitude likely to be encountered. 
If each set of four replicates is completed prior to beginning the next set, the 
appropriate analysis of variance is given by Yates [327] and others [60, Ch. 
11; 175, Ch. 26]. If, on the other hand, the fifty-two tasters are randomly 
allotted to the fifty-two incomplete blocks and if there is no valid reason for 
grouping by replicates, the analysis of variance for g sets of the basic design 
is of the form: 


Mean square 


Source of variation df 
Observed Expected value 

Treatment (ignoring block) у = 1 — — 

Block (eliminating treatment) qb—1 Es o + (qbk — v)eg?/(qb — D 
Component (b) b —41 с + (bk — v)eg?/(b — 1) 
Component (a) b(q — 1) o + Кез? 

Intrablock qbk—qb—v-41 E. сд 

"Total qbk = 1 = = 


(A— a T — d m == 


The standard error of a difference between any two adjusted means is 


2k(v — 1) 24) 
Учай — D) + w'(v — Ю)] MS 


h 
where w= 1/E, III-25) 
and $ x 
pe kbq — v . 111-26) 
К — Е — (v — ЮЕ, GE 
Drs. G. A. Joh 


annessen and J. D. Hartman, 


a 
ment of this type, Cornell University, conducted an exP ен 
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The weighting factor is equal to 
w — ш! 
^T 1) шо 0 ee 
The adjusted treatment means are computed from formula (XIII-14), with 
г replaced by rq in the denominator of the correction factor. 


XIII-2. PARTIALLY BALANCED INCOMPLETE BLOCK DESIGNS 


As is obvious from formulae (XIII-1) and (XIII-2), large values of r are 
obtained for certain values of b, k, and v. In order to obtain incomplete block 
designs with smaller numbers of replicates, it is necessary to use incomplete 
block designs which are partially balanced [27, 30, 32, 175, 226, 320, 322, 
324, 326]. Examples of resolvable p.b.i.b. designs (e.g., double, triple, cubic, 
and simple rectangular lattices) have already been discussed. P.b.i.b. designs 
With the treatments not in complete replicates are presented below. 

Bose and Nair [30] give the following specifications for a p.b.i.b. design 
[32; 175, Ch. 27; 255]: 


(i) The v treatments with г replicates per treatment are arranged in b incomplete 
blocks of k treatments each. 

(ii) With regard to a particular treatment j, the remaining (р — 1) treatments 
fall into m sets, the gth set of which occurs in №, blocks with treatment j and 
contains n, treatments; the numbers п, are the same regardless of the treat- 
ment specified. 
m 

UP TC (XIII-28) 

g-1 

m 

2n, = r(k — 1). (XIII-29) 
с=1 

(iv) If we denote the treatments occurring in an incomplete block А, times with 
a specified treatment j as the gth associates of j and if we denote the treat- 
ments occurring №,/ times with a treatment j’(# Лаз the fth associates ofj’, 
where j and j’ are hth associates, then the number of tremenis in common 
between the gth associates of j and the fth associates of j' is Poss this number 
is the same for any pair of hth associates. 


(v) Pos” = prot. (XIII-30) 
пруу* = поруһо = пуро (XIII-31) 
по — l, fork =r | 
Урџћ = | (XIII-32) 
д Ng fork #r). 


Bose and Nair denote г, v, b, k, А №, ***s Ams Ni, Na, ***, Nm 88 parameters 
Of the first kind and the Pos?” as parameters of the second kind. 
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The methods of constructing p.b.i.b. designs are not discussed herein.! 
The analysis for m = 2 is illustrated with an example. For the analysis of a 
p.b.i.b. design with m — 3, the reader is referred to Chapter XI and to the 
paper by Nair [221]. Nair also discusses the analysis for the simple rectangu- 
lar lattice which is an m — 4 associate class design. 

If E, < E., Rao [255] recommends that the experiment be analyzed as a 
completely randomized design. When less than 12 degrees of freedom are 
associated with Æ, and when E, > E, it is suggested that recovery of inter- 
block information be dispensed with and only intrablock adjustments be made. 

Rather than repeat the basic design, it is recommended that attempts be 
made to balance the design or to approach a balanced design in order to 
approach equal precision on treatment comparisons. If this is undesirable, 
then the basic plan of the p.b.ib. design with a different randomization 
could be repeated. The form of the analysis follows that described in the 
previous section. For a more complete discussion of these designs the reader 
is referred to Bose and Nair [30], Kempthorne [175, Ch. 21], Nair [221], and 
Rao [255]. The analysis for the p.b.i.b. design given in example XIII-2 closely 
parallels the discussions and examples given by these authors. 


Ezample XIII-2. An example of a partially balanced incomplete block design 
was constructed from the data presented in table XIII-1. All incomplete blocks con- 
taining treatment 1 were deleted. This procedure of constructing a partially balanced 
incomplete block design from a balanced incomplete block design is known as " variety 
cutting" [255; see 113]. The resulting design may not always be the simplest one 
available for the particular values of b; v, and k. In order to obtain a relatively simple 
design, the experimental results in incomplete blocks 10, 12, and 14, which contained 
the following sets of treatments: (4, 5, 6, 9), (3, 4, 9, 10), and (3, 4, 7, 8), were arranged 

' into incomplete blocks containing the following sets of treatments: (4, 6, 8, 9), (3, 4, 5, 
9), and (3, 4, 7, 10) (table XIII-4). These shifts resulted in a p.b.i.b. design with 
m = 2 associale classes, which is similar to the simple lattice analysis (see example 
XL1). The notation used herein is somewhat different from that used in example 
XI-1 and by Rao (255]; it is more in agreement with that used by Bose and Nair 
[30], Kempthorne [175, Ch. 27], and Nair [221]. 

The parameters of the first kind for this particular p.b.i.b. design are 


v=9, b-9, r=4, k=4 
(XIII-33) 
№= 1, = 4, А = 2, п = 4 e 


p treatments which appear together once in the incomplete blocks are denoted 48 
an associates, and those which appear together twice in the incomplete blocks 4° 
second associates. The treatments occurring together once in one of the incompleté 


Blocks Bave the same type of association with other treatments. Hence, this desig” 
18 а two associate class design. 


и !All known p.b.i.b. designs for m — 2 ulated by 
Bose, Clatworthy, and Shrikhande [26]. ааны S&S 10 have been tab 
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TABLE XIII-4. Partially balanced incomplete block design constructed from the 
balanced incomplete block design in table XIII-1 


Incomplete 
Incomplete Treatment number block total 
block (perica) =B ü 


TO TO Vi fO fO fO fo по го 
У Ӧн лл к 
Эз оо 0 EDR 


36.4 32.4 34.1 21.7 20.5 16.6 


921; 48.15 8:51. 5719015 0 ШӘ 


0:15. 1.5210] 12259 2:0 0.8 


-T 
3 
8 
8 
T 
8 
2 
.0 


D 
в 
M 


The parameters of the second kind are 


Т2 22 
чп C ? SER C ) ми 


Тһе Pos’ matrix refers to the number of associates in common for pairs of treatments 
which are first associates; e.g., treatments 2 and 3 are first associates; they have one 
TSt associate in common, two second associates in common, and two of the treat- 
ments in common which are first associates of one and second associates of the other. 
he Pos” matrix refers to the number of various types of associates in common for a 
Pair of treatments which are second associates. The two kinds of parameters for a 
P.b.i.b, design fully determine the incomplete block design, although there is some lee- 
Way in the selection of treatments appearing together in an incomplete block. 
The analysis of the reduced design is described below. The incomplete block, the 
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treatment, and the grand totals are presented in table XIII-4. It should be noted that 
the ranges of panel scores by treatments are more alike than for the complete data in 
table XIII-1. The inclusion of the periods containing treatment 1 increased the range 
of panel scores. 

The next step is to construct table XIII-5 in order to complete the analysis of 
variance table (table XIII-6). The first column of table XIII-5 contains the treatment 
numbers; the second column contains the treatment. totals; the third column contains 
the sum of the incomplete block totals, B.u, which contain a given treatment; thus, 
treatment 2 appears in incomplete blocks 2, 3, 6, and 7, and (SB)sis equal to B.» + В.з 
+ Bi + Ba = 27.6 + 244 + 22.9 + 25.4 = 100.3. The values in the fourth column 
are obtained from the formula, 


0; = kX.; — (SB),, (XIII-35) 


which for j — 2 is equal to 4(38.0) — 100.3 = 51.7 = Qz. The first associates of the 
various treatments are listed in column 5. The sixth column contains the sum of the 
Q values for treatments which are first associates of treatment J; to illustrate, treat- 
ments 3, 4, 5, and 9 are first associates of treatment. 2, and the sum of the Q values for 
these т = 4 treatments is equal to 35.5 + 28.6 + 30.4 + (—34.8) = 59.7 = $` 0а 
= Qs + Qi + Qs + Qs. 

Before proceeding further with table XIII-5, it is necessary to calculate a number 
of constants associated with the design; thus: 


Ay —r(k—1) +2 =4(44-1) +2 = 14, (XIII-36) 
Án = (№ — Ара = (2 — 1)(2) = 2, (XIII-37) 
Ве = №- № = 1, (XIII-38) 
В = Ar + (А — M)(pu! — ри) = 14 + (1 — 2) = 13, (XIII-39) 
Аш = r(k — 1) + = 44 —1) 4 1 — 18, (XIII-40) 
An = (№ – Agpis = (1 — 2)(2) = —2, (XIII-41) 
Bu = №-№= –1, (XIII-42) 


d Ва = An + Вц(рз? — p) = 13 + (—1)(1 — 2) = 14, — (XIII-43) 
an 


A= А,В» = AssBis = Ay By = AuBy = 180. (XIII-44) 


; The seventh column contains the adjusted treatment effects without recovery of 
interblock information. These values, ĉ;, plus the over-all mean, 2 = 6.317, result in 
the adjusted treatment means. The adjusted treatment effects without recovery 
interblock information are obtained from the formula, 


0; вы 
4, 10а Bal _ BaQ; – Be Qn 
1 А -— А 
= (Bx + Вь)0; — В (0а + 0), (хш-45) 
А 


For example, #; = [13(51.7) — (1)(59.7)]/180 = 3.402. 
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After completing the computations in the first seven columns of table XIII-5, 
table XIII-6 is constructed. The various sums of squares are obtained below. 


Total (35 df): 
9.6? + +++ + 3.8? — 227.4?/36 = 1649.10 — 1436.41 = 212.69. 


TABLE XIII-6. Analysis of variance for the experiment in table XIII-4 


Source of variation 


ss 
1659.10 
1556.51 


Total uncorrected 


Correction for mean 
Total 


Treatment (ignoring block) 


Within treatments 21 


Block (eliminating treatment) 68 
Intrablock 


189.688 
14.88 


Treatment (eliminating block) 
Block (ignoring treatment) 
Intrablock 


Treatment ( ignoring block) (8 df): 


38.0? + +++ + 12.0? 272 = 1634.68 — 1436.41 = 198.27. 
4 3 


Block (ignoring treatment) (8 df): 4 
LB. _Х.з 276+ -+t +277 2214! 45129 — 1436.41 1488. 
TEM = 36 


i s i (XIII-46) 

Treatment (eliminating block) (8 df): T B (XIILAT) 
T 2 Bury. _ Ваз Оу. ї 
50/6 = 189.688 = FRX" — FAX 004 


Block (eliminating treatment) (8 df): 
£ #;Q;/k 
XB Х.? |202 Xi roe} c YLB.Sk — 3X. jr 250; 


be о RE (XIIL-48) 
= 1451.29 — 1634.68 + 189.688 = 6.298. 
Inlrablock (bk — v — b + 1 = 19 df): bof 
212.69 — 198.27 — 6.298 — 212.69 — 189.688 — 14.88 — 8.122. 
The estimated amount of intrablock information is ies 
ш = 1/E, = 1/0427 = 2.342, ( 
and the estimated amount of interblock information is 
bk — v E 20 = 1171. (ХШ-50) 
ш = ҚБ 1)Es - @— ЮЕ. 320787) — 5(.427) 
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In order to complete the calculations in table XIII-5 and to obtain the adjusted 
treatment means with recovery of interblock information, the following quantities 
are required: 


R = r[w + w'/(k — 1)] = 4(2.342 + 1171/3) = 10.929. (XIII-51) 


А = X(w — ш) = 1.171. (XIII-52) 

Аз = М(ш — ш) = 2(1.171) = 2.342. (XIII-53) 
Ai = R(k — 1) + А» = 3(10.929) + 2.342 = 35.129. (XIII-54) 
Az! = (Аз — A)piz? = (1.171)(2) = 2.342. (XIII-55) 
By! = Аз — Ay = 1.171. (XIII-56) 
В = А + Bur (pi! — pu?) = 33.958. (XIII-57) 
An! = R(k — 1) + A; = 33.958. (XIII-58) 
An’ = (Ai — Адра = —2.342. (XIII-59) 
By’ = Ay — Аз = —1.171. (XIII-60) 
Ba’ = Аш + Вц (рз? — pz?) = 35.129. (XIII-61) 

A’ = Ax! Boo! — Ax! Bi? = АВ — Ал By! (XIII-62) 


= 35.129(33.958) — 2.342(1.171) 
= 33.958(35.129) — (—2.342)( — 1.171) = 1190.168. 


The eighth column of table XIII-5 is computed using the formula, 
P; = wQ; + w'(SB); — гш, (XIII-63) 


which for j — 2 is equal to 2.342(51.7) + 1.171(100.3) — 4(4) (1.171) (227.4/36) 
= 121.0814 + 117.4513 — 118.3491 = 120.184. The ninth column contains the sum 
of the P; values for the treatments which are first associates of treatment j; thus, for 
j = 2 the УР = 93.719 + 66.903 + 76.974 — 85.561 = 152.035. In the tenth and 
last column the adjusted treatment means with recovery of interblock information 
are presented. The formula for obtaining the adjusted treatment means is 


1% Bue" 1 
af =#+ N =3+ gBP; x BYZ Pi), (XIII-64) 
LPi Bw 
which is similar to formula (XIII-45). The adjusted mean for treatment 2 is equal to 


1 
$3167 + =т01168183.958(120.184) — 1.171(152.035)] = 6.3167 + 3.2795 = 9.60. 
теша adjusted means are obtained in а similar manner. Р 
sons ds variances for differences of treatment effects derived from intrablock compart- 


Vi = 2kBnE./A = оооу = .2657 (хш-69) 
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and 
Va = 2kBoE./A = 2:03 42D) = 2467 (XIII-66) 
for pairs of treatments which are first and second associates, respectively. The average 
variance of a mean difference is equal to 
от Бозу _ mVi + nV: _ 4(.2657) + 4(.2467) 

ааа at $—1 2562. (XIII-67) 
The same form of the above formulae holds for the differences between adjusted treat- 
ment means when interblock information is recovered; thus: 


ЗАВ, _ 2(4)(35.129) _ ' 
go 7. 1190168 ^ 2501 (XIII-68) 
and 
_ 2kB»/ _ 2(4)(33.958) 
Vi RESP Cond ob ык (XIII-69) 


for pairs of treatments which are first and second associates, respectively. The average 
variance of a difference is equal to 
y'- : j' + п202) = 2322. QXIII-70) 
eem 
The recovery of interblock information did not appreciably alter the variance of а 
difference between two treatments. Since Æ» is not too much larger than E. this is 
what would be expected. 
The efficiency factor for this design is equal to 
Aia Bia 
(0 DlAm — Bm. (v — DA XIII-71) 
Eff = E Tt Вы = rk[(o — 1)B22 + mBiz] ( 
—n В 
_ 8080) 
= 404)[8(13) + 4(1)] 


o 
50 


XII-3 Incomplete Block Designs for the 
Two-Way Elimination of Heterogeneity 


Another incomplete block design is the incomplete latin square. For 
example, one or more rows or columns, or one or more rows and columns, of a 
k X k latin square may be omitted [321, 333]. The Youden square [28, 264, 
266, 333, 336, 337] and the semi-latin square [201, 284, 319], in some instances, 
Tepresent special cases of a latin square design with one or more rows omitted. 

lso, rows or columns may be added to a k X k latin square [60, 264], and 
Other situations are possible [28, 241, 242, 264, 269]. The examples discussed 
elow illustrate some of the incomplete block designs with a two-way elimina- 
tion of heterogeneity. 
he designs discussed in this section are of the single latin square type. 

© source of variability controlled is similar to that controlled by rows and 
Columns in the latin square. Designs such as the one-restrictional lattice in 
complete replicates are not considered as belonging to this group. The lattice 
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square design, a two-restrictional lattice, is of the latin square type within 
each replicate but not over the entire experimental area. 


XIII-3.4 ONE OR MORE MISSING ROWS OR COLUMNS IN A LATIN SQUARE 


In latin squares the yields for one or more rows, columns, or treatments 
may be omitted from the analysis. Also, the error variances of some treatments 
may be different from other treatments, and it may be desired to run the 
analysis on only a portion of the treatments in a k X k latin square. The 
omission of a treatment in a k X k latin square results in an incomplete 
latin square. The omission of one or more rows or columns in a k X k latin 
square results in an incomplete block design, since all treatments do not appear 
in the same row or column. Any omission results in some nonorthogonality ; 
despite this, some of the resulting incomplete latin squares are relatively 
simple computationally. For a complete discussion of these designs the reader 
is referred to the papers by Yates [321] and Yates and Hale [333]. Dr. G. S. 
Beavans' “chessboard” system of square-yard plots [284; Chapter I] with 
proper randomization fits into the scheme of analysis proposed by Yates and 
Hale. 


XIII-3.3 THE SEMI-LATIN SQUARE AND SIMILAR DESIGNS 


The semi-latin square design (also called a balanced or equalized block 
design) consists of a k X k latin square design with p split plots per whole 
plot. There are » = pk treatments compared in the k X k latin square. For 
example, with ten treatments arranged in a 5 X 5 latin square, Student [284] 
suggests the following arrangement for treatments A, B, C, D, E, Е, G, H, І, 
and J: 


H E с A 
Row 1 
D J B I 
J D F E 
Row 2 
G I A C 
I A G D 
Row 3 
B Cc H F 
Е 
Row 4 В | 1 
E G D H 
Row 5 А Е 1 B 
[e H E G 


Co.l Се?  CoL3  CoL4  CoL5 
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The order within a square is important; each treatment appears in the top 
and bottom rows an equal or nearly equal number of times. Аз Fisher states 
[319, p. 231], the correct error for any particular contrast “would be exceed- 
ingly laborious to estimate.” Yates [319] points out that the **usual method" 
of analysis for the semi-latin square leads to biased estimates of the error 
variance. The design has been used by a number of workers [113]. Ma and 
Harrington [201] report that 133 semi-latin squares (or modified latin squares) 
were used at the University of Saskatchewan and associated stations during 
the years 1936 to 1944. While recognizing the bias in the error variance for 
these designs, they estimated that the average error variance for comparable 
randomized complete block designs conducted over the same period was 
approximately equal to that for the semi-latin squares. 

Yates [324, sec. 16i and 16j] has proposed a modification of the semi-latin 
Square; he suggests arranging the 2k treatments in such a way that they cor- 
respond to the Latin and Greek letters in a graeco-latin square. He describes 
the arrangement and analysis for fourteen treatments arranged in a 7 X 1 
graeco-latin square. He also gives a limited discussion of designs of this type 
for 3k treatments in a Ё X k hyper-graeco-latin square. Yates suggests that 
ee designs may be suitable for treatment numbers between ten and twenty- 
our, 


ХПІ-3.3 YOUDEN SQUARES 


Youden [336, 337] introduced a set of incomplete latin squares which have 
become known as the Youden square design. This design combines features of 
the latin square and the balanced incomplete block designs. The Youden 
Square design was proposed to control the variability due to position or order 
Within an incomplete block. In studies on tobacco mosaic virus, Youden found 
that the position of a leaf on the plant affected the response. The number of 
leaves was insufficient to allow use of a k X k Jatin square. The design pro- 
Posed by Youden allows for a two-way elimination of heterogeneity, such as 
that due to plants and position of leaves on a plant. All treatments appear in 
any given leaf position (rows) and only a fraction of the treatments appear 
on any given plant (columns or incomplete blocks). For example, то = 7 
treatments were to be compared and if there were only k — 3 available leaf 


Positions, the appropriate Youden square design is 


Column or plant 
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If, on the other hand, k — 4 leaf positions were available to compare the 
v = 7 treatments, the appropriate Youden square design is! 


Column or plant 


In the first Youden square above, v = b = 7,k =r = 3, and à = 1, while in 
the second, v = b = 7, k = r = 4, and à = 2. Each treatment appears with 
every other treatment once within the incomplete block (column) in the first 
Youden square, whereas each treatment appears twice with every other 
treatment in the second design. 

Although the above method of constructing a Youden square from a 
specific latin square is suitable for some cases, other methods of construction 
are needed. Smith and Hartley [266] give a practical procedure for converting 
balanced incomplete block designs with b = v into Youden square designs. 
Also, Cochran and Cox [60, Ch: 13] and Youden [337] list experimental plans 
for Youden squares for various values of v. 

The randomization procedure for a Youden square design is 


(i) to randomly assign the treatments to the numbers, 
(ii) to randomly assign the columns to the incomplete blocks, and 
(Ш) to randomly assign the rows to the replicates. 


If the Youden square is replicated, repeat steps (ii) and (iii) using another 
randomization. A complete latin square design (or sets of latin squares) is 
preferable to repetitions of the Youden square design, but if, owing to the 
nature of the experimental material, it is impossible to use a complete latin 
square design, repetitions of the Youden square design are used. 

The analysis of variance for q repetitions of a Youden square is 


Source of variation df Mean square 


Dischi EATUS y 
LE Observed Expected А 
Row (replicate) qk —1 z= 


Column or block (elim. treat.) q(b — -4q(v-1 Ey o+ (Benet g 
а(у — 
Component (a) ( 
q-D(v-1) — a kep 
Component (b) v —1) — eit tk - vogt/(v =D 
Treatment (ignoring block) v—1 


Intrablock erro 


Тош x (v - D(gk- q— 1) Е, Uu 


qvk —1 = 


‘It may be noted th; , 
may be made of this fet ics. above two Youden squares form a 7 X 7 latin square. Use 


A settin i i ial. Separate 
and combined analyses are АТУ, g ap кышы on certain types of material. Sep 
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The row or replicate, the total, and the treatment (ignoring block) sums 
of squares are obtained in the usual manner. The block (eliminating treat- 
ment) sum of squares is obtained as the sum of the sums of squares for com- 
ponents (a) and (b). The interaction of groupings of k treatments with repe- 
titions of the basic plan yields a sum of squares with (q — 1)(b — 1) degrees 
of freedom and is computed in the same manner as for previous component 
(а) sums of squares. The component (b) sum of squares is equal to 


X We/akwo — ®(® — 1), (XIII-12) 
where det 
W; = (w — &)X.; — (о — 1)(SB); + (k — 0А... (XII-73) 


The above symbols are the ones used in the previous sections of this chapter. 
The estimated weights are equal to 


ius 17H] (XIII-74) 
and 
„= шу eae II-75) 
ир kq(v — 1), —(v— k)E, ae 
The weighting factor is equal to 
A "m а 4(Еь — Е.) - (XIII-76) 
ok — Dw F @— Ew gok — E+ (0 – kla – DE. б 


Тһе adjusted treatment mean with recovery of interblock information is 
*qual to the unadjusted mean plus an adjustment; thus: 


z; = £; + nW;/qk. (XIII-77) 


'Тһе standard error of a difference between two adjusted means is equal to 
e + @— Юн}. (XIIL78) 
The formula for a missing experimental unit is given by Cochran and Cox 
(60, р. 375]. Also, one may use the method for treating missing data as 
described by Rao [255] (a one is inserted for the missing observation and a 
zero for all other observations, and analysis of covariance 18 performed on 
the pseudo-variates composed of zeros and a one). 
= posed ot 2 à 
Youden [336, 337] and Cochran and Cox [60, Ch. 13] present the analysis 
for Youden squares without recovery of interblock information as well as 


with recovery. 


XI-34 OTHER DESIGNS 

Cochran and Cox [60, Ch. 13] present a series of designs for small numbers 
9f treatments which consist of a Youden square design added to a complete 
atin square design, A numerical example is also given. In connection with 
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these designs, it should be noted that it is simpler in some cases to use two or 
more latin squares than to use one of their designs. However, these designs 
are important and useful if only a fixed amount of material is available and 
this amount is less than required for sets of complete latin squares. 

Pearce [241, 242] developed a number of designs of the latin square type. 
He considered the analysis and layout for latin squares with an additional 
column, with an additional row and an additional column, and with a column 
added and a row omitted. Shrikhande [264] considers some general classes 
of designs for the two-way elimination of heterogeneity. He presents balanced 
incomplete block designs for b — mv where m is an integer. Shrikhande also 
considers the balanced incomplete blocks where b is not an integral multiple 
of v and shows that it is possible to obtain designs which have a two-way 
elimination of heterogeneity. For the latter designs, some pairs of treatments 
have one error variance while others have another error variance, whereas 
in a balanced incomplete block design with only one-way elimination of 
heterogeneity the error variance is the same for any pair. Bose and Kishen 
[28] and Shrikhande [264] developed a class of designs which they call partially 
balanced Youden squares. These designs are useful for larger treatment 
numbers than those generally suited to the Youden square design. 


XIII-4 Split Plot Designs for Nonfactorial Experiments 


For a large number of treatments, v, various experimental designs (see 
Chapters XI and XII and preceding sections of this chapter) may be utilized 
to control variability over the experimental area. If the experimental ma- 
terial falls into g natural groupings, it may be desirable to obtain more precise 
comparisons on the k individual items within each natural group than on 
comparisons between individual entries not in the same grouping. If only the 
contrasts on the Ё items within each natural grouping are desired, then each 
natural group may be tested in a separate experiment. If the comparison of 
individuals not in the same natural group is equal in importance to the com- 
parison of individual entries in the same group, then the gk = v entries аге 
compared in a single experiment (e.g., in an incomplete block design). How- 
ever, the actual situation is usually somewhere between these two extremes 
in that some information is desired on all comparisons but with greater pr 
eision on contrasts among individuals within a natural group. 

r pepe of material in natural groups are available from various fields- 
d S MEL ing, one might be interested in g crosses and k selections gee 
arent ntrasts among selections within crosses may be of more impo 
of the an contrasts among selections from different crosses. Also, the nature 
Thien a is de á advisable to group the material from each c 
Albo y exdssed with kia Y, one might be interested in the characteris 3 
TS of size r each. In a baking study, one might 28 
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to study characteristics on g recipes with Ё combinations of the ingredients 
of each recipe. Also, one might be interested in g teaching methods with Ё 
teachers using each method; a total of gk teachers participate in the experi- 
ment, and each teacher teaches r classes. These illustrations represent but a 
few of the examples and fields that contain material of this nature. Yates 
[322], Cochran [47], and others [165, 227, 255] have proposed designs suitable 
for these conditions. The construction and analysis of two of the designs are 
discussed below. 


XIIH-41 GROUPS OF SIMILAR TREATMENTS IN EACH WHOLE PLOT 


If g crosses with k selections per cross represent the v treatments, a “split 
plot design” with the g crosses randomly allotted to the g whole plots in each 
replicate and the k selections per cross randomly allotted to the k experimental 
units or “split plots” within each whole plot may be used. For r replicates 
the analysis of variance is of the following form: 


df Mean square 


Source of variation 
Replicate r-l — 
Crosses or whole plots —1 к=. 
IE Error (a) (т — D(g — 1) DW 

Selections within cross 1 k-1 — 
Selections within cross 2 k-1 = 
Selections within cross k—1 v 

E gk- 1)(т — 1) E» 


Lol, Eme) ЈР ҮШ SS DIT ENUME 
In the above analysis the interaction of the k selections with replicates for 
each cross may be computed; this interaction has (k — 1)(r — 1) degrees of 
freedom, If the individual interaction mean squares are all estimates of the 
Same quantity, say og’, they may be pooled into the single mean square E 
with g(k — 1)(r — 1) degrees of freedom. Also, the above split plot design 
may be analyzed as g separate randomized complete block experiments if de- 
Sired, This is possible despite the fact that the separate replicates for one cross 
аге intermingled with the corresponding replicates for the other crosses. 
The standard error of a difference between the means of two selections 


from the same cross is (ХПІ-79) 


v 2Ewr, 


9f two selections from different crosses is 


Aa. + @ — DED, (XIIL.80) 
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and of two cross means is 
V 2E. rk. (XIII-81) 

The above design has the advantages that the natural groupings are re- 
tained and that comparisons on the selections within a group or cross are more 
precise than without grouping. The disadvantage of this design is that less 
precision is available on comparisons of selections not in the same cross. 

Instead of having g randomized complete block designs intermingled in 
such a way that complete replicates are formed, g incomplete block designs 
could be used. That is, an incomplete block design suitable for k treatments 
in r replicates could be selected. The same design with different randomiza- 
tions would be used for the g crosses. Also, the g whole plots could be designed 
as a g X g latin square. Formulae (XIII-79) to (XIII-81) would still hold 
with the corresponding effective error mean squares replacing E, and Ey. 


XIII4.2 INCLUSION OF RANDOM CONTROLS IN THE SPLIT PLOTS 


With the idea of increasing the precision on selections from different 
crosses, controls have often been added to the k treatments comprising the 
split plot. The addition of c controls within each whole plot increases the size 
of the whole plot from k to k + c experimental units. The total additional 
experimental units required is grc. The use made of the additional experi- 
mental units is to make comparisons with the controls and to compare treat- 
ments in different randomized complete block designs through their common 
controls. If two treatment means, say 2; and i», not in the same group are 
compared through the controls in these two groups, say Z. and 2, the com- 
parison of (2, — 4) with (2, — Z.2) has the following standard error: 


Wise T on Sep grub sz t. (XIII-82) 


which is equal to 4//4s?/r if all means are subject to the same error variance 
and if all means have the same number of replications. 

In the event that more than one experimental unit per whole plot receives 
the check treatment, Yates [322] has shown that the optimum number of 
checks to include in the block for a given number of experimental units is 


с= -14 VATER, (XIII-83) 
where the greatest error variance of any comparison is used as the criterion 
for assessing the accuracy of an experiment. The integer nearest to the value 
of the above expression is used for the value of c. 
+ Yates [322] states that the use of random controls is less efficient tban 
d systematic spacing of controls in a randomized complete block experiment. 
^ e ae out that with smaller groups a latin square design may be used; both 
e and Cochran [47] state that there is not much to recommend these desig 
and the methods of comparison discussed in this section. They state that ш 
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design of the preceding section or some other incomplete block design is 
preferable to those discussed in the present section. 


XIII-5 . Linked Block Designs 


In a brief dissertation, Youden [338] and Eisenhart [101] discuss the de- 
sign and analysis for the partially balanced incomplete block designs desig- 
nated as Youden linked block designs. It is stated [338] that these designs 
possess "symmetry, ease of analysis, reasonable block size and a small number 
of replications.” Examples of the linked block designs are listed below. 


Bose and Shimamoto [32] discuss the relationship of linked block designs to 
other p.b.i.b. designs. Additional work on these designs has been reported by 
Youden and Conner [339] and by Mandel [206]. 


CHAPTER XIV 


Balanced Designs 


XIV-1 Introduction 


Experimental designs discussed thus far pertain, for the most part, to 
situations where the total number of experimental units is equal to the number 
of treatments times the number of replicates and where each experimental 
unit is used only once. Designs discussed in the present chapter pertain to 
situations in which the treatments are applied in sequence over several 
periods to a group of individual items and in which the number of experi- 
mental units may be less than the number of observations. Each period of 
observation represents a cycle, and several cycles are required to complete 
the requirements of balance in the designs. Since the aspect of balance through 
time or over cycles is important in the designs discussed in the present chapter, 
the term balanced designs is used to include all such designs. Particular 
balanced designs have been discussed under various names; €.g., cross-over 
[57, 117, 198], switch-back [273], switch-over [38, 273], reversal [38, 273], 
change-over [57, 60, 175, 198-200], double change-over [57, 175], double- 
reversal [38, 198, 235, 263, 273], double change-over with the last period 
repeated [157, 198], rotation [46, 77, 331], etc. [236, 325]. A brief discussion 
of some of the designs is presented in the following sections, and the similari- 
ties of the designs are noted. 

р, In certain types of experiments (e.g., rotation) the length of the period 
is fixed. In others, the length of the period of observation is determined by 
the experimenter. Care should be taken to insure that the period is of sufficient 
duration to allow expression of the treatment effect. Also, the period shoul 
be long enough so that the effect of the preceding treatment does not exte? 
beyond the next period, 
x prom effect that continues after the treatment has been шее 
E a carry-over or residual effect. The carry-over effect from 
t rin treatment, affects the measurement of the effect of the present 
icq ie carer ert may De then cae of by expel 
rest period may c iym às rest period" between the treatment periods. am 
or of Re ог m of a period of no observation on the present кем 
reatment. The lengths of the rest period and of 
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period of observation on the treatment need not be of the same duration 
and are determined by the nature of the experimental material and by the 
character measured. 


XIV-2 Change-Over Designs 


XIV-2.1 THE SIMPLE CHANGE-OVER DESIGN 


À design combining the features of the latin square and the randomized 
complete block designs has been used for comparing two to four treatments in 
Some dairy husbandry and other biological studies. This design may also be 
Used to advantage in psychological and marketing research. Like a latin 
Square the cross-over design has two restrictions imposed on the randomiza- 
tion of the “treatments” to the “plots” or individual units. The treatments 
are all included in each replicate or group. The individual units are rated 
with regard to time of application in each replicate or group. The second 
Testriction is that each treatment must be applied an equal number of times 
In each period or time in the replicates. ч 

In the simplest case, consider two treatments, А = supplemental feeding 
and B — no supplemental feeding, administered to six dairy cows. Each cow 
18 to receive treatments A and B in periods 1 and 2. The treatments A and B 
аге allotted to the periods at random with the restriction that half of the cows 
receive treatment А and the other half receive treatment B in period 1, and 
the Cows receiving treatment А (or B) in period 1 receive treatment B (or А) 
11 period 2. The experimental design for the six replicates (six cows) is of the 
following nature: 


Columns 
(cows or replicates) 


Rows rizuimi|i|v|evr 
Period] | B| B| A | ^| BY A 
Period 2 | A | A] B| B| A | B 


If the experiment is conducted as three 2 X 2 latin squares, the design is 
of the following nature: 
Square I 


Square II Square III 
Columns 
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where the six cows are grouped into three pairs. The difference between the 
cross-over design and sets of latin squares is illustrated by the following 
breakdown of degrees of freedom in the respective analyses of variance: 


Cross-over design Three 2 X 2 latin squares 

Source of variation df Source of variation 
Column or pair 5 Square 2 
Row (period 1 vs Columns in squares 3 

period 2) 1 Rows in squares 3 
Treatment 1 "Treatment 1 
Residual — error 4 Tr. X square = error 2 
Total 11 Total 11 


The main difference lies in the fact that there are more degrees of freedom 
associated with the error sum of squares in the cross-over design and that 
there is a less complete elimination of row effects. This design is suitable 
when row differences are approximately equal in all replicates, and, therefore, 
most of the row effects are removed by the single degree of freedom for rows, 
and the error mean square is no larger than for the latin square. Also, there 
are more degrees of freedom available for error. 

The cross-over design may be used for any number of treatments with the 
condition that the number of replicates must be a multiple of the number of 
treatments, but it is inadvisable to use a cross-over design in place of a latin 
square if there are more than four treatments, 

If the above design were applied to an experimental situation which re- 
quires a separate experimental unit for each treatment in each replicate, the 
analysis would be the same as that given above. For example, suppose that 
two treatments, A and B, are applied to dairy cows, that one treatment 
period is used, that twelve dairy cows are available for the experiment, that 
the twelve cows are paired into six pairs with each member of the pair being 
rated as “superior” or “inferior,” and that one-half of the superior and one- 
half of the inferior cows receive treatment A and the remaining cows receive 
treatment B. The experimental design might be of the following form: 


Rows 


Cows or replicates 


Superior 


Inferior 


Pearce [242] has denoted this type of design as a “ 


Каро уыт З tied latin е” to dis- 
tinguish it from the simple change-over design. In t| кл, 


he latter design, different 


| 
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treatments are applied to the same experimental unit in two or more time 
periods; i.e., there is a changing of the treatments on a unit. In the tied latin 
Square design, no changing of the treatments is involved, and there are as 
many experimental units as there are observations. 

For three treatments the simple change-over design is of the following 
form (other arrangements are possible): 


Columns or replicates 


Rows (period) 


Multiples of three replicates are required in order to have each treatment 
appear in each row an equal number of times. Ап alternative design for the 
above is to use two 3 X 3 latin squares. The breakdown of the degrees of 
freedom in the analysis of variance for the simple change-over design and for 
two 3 X 3 latin squares is 


Change-over design Two 3 X 3 latin squares "A 


Source of variation df Source of variation 
Replicate (column) 5 Square or set 1 
Row 2 Columns within square 4 
Treatment 2 Rows а а 4 
тор, 8 Treatment 2 
Treatment X square A 
Шош 17 Error within square 4 


analysis for the 3 X 3 latin squares may 
be pooled justifiably, 6 degrees of freedom are associated with the error for 
treatments, while 8 degrees of freedom are associated with the error mean 
Square in the change-over design. Thus, if the rows with two degrees of freedom 
control approximately the same amount of variation as the TOWS within squares 
with four degrees of freedom, the change-over design is more efficient than 
two 3 X 3 latin squares. Even if the error mean square in the change-over 
design is not appreciably smaller than the error mean square for the 3 X 3 
latin Squares, the extra degrees of freedom associated with the error mean 
Square in the simple change-over design make it the preferred design. : 
Simple change-over designs for four or more treatments may be set up in 
the manner described above for two and three treatments. The analysis fol- 


lows that outlined in example XIV-1. 


If the last two mean squares in the 
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Fieller [117] and Cochran and Cox [60, sec. 4.4] illustrate the method of 
analysis for a missing datum, and Fieller [117] describes the covariance 
analysis for the simple change-over design. 


Example XIV-1. Suppose that two merchandising practices, C and E, are com- 
pared in a store for a period of eight weeks. Furthermore, suppose that the display 
counter is such that the customers approach from one end of the counter. With regard 
to the product and to the practices used in this experiment, it is suspected that cus- 
tomers will tend to select items from the practice that is observed first. Therefore, the 
display counter is divided into two parts, first and second positions, and the design 
is such that both practices appear in the store in every week and each practice appears 
in the first position during four of the eight weeks and in the second position during 
the remaining four weeks. The procedure is to allot the practices randomly to the 
weeks with the restriction that treatments C and E must each appear in the first 
position in four of the weeks. The arrangement, the total sales in pounds per week 
(artificial), and the various totals are given in table XIV-1. The total, replicate (week), 
row (position), treatment, and residual sums of Squares are computed as follows: 


TABLE XIV-1. Total sales (pounds/week) for two merchandising practices 


Meek or replicate nunber 


[айша m Da Las RET aC i 


E-71 | С-66 | E-h9 | c-59 | E-50 | Е-50 | c-h8 C-59 
С-50 | Е-%ә | C-lh9 | E-50 | c-ho | c-43 | Е-з1 | g-48 


Position 


Total for C = 423; total for E = 391. 


Analysis of variance 


Source of variation 


yariation af вв ms 

Replicate T 552.75 78.96 
Row (position) 1 506.25 506.25 
Treatment 1 64.00 64.00 
Residual 6 208.75 34.79 


Correction term (1 df) = 8142/16 = 41412.25. 
Total (15 df) = 712 + 50? + +++ + 48? — 41412.25 = 1331.75. 


Replicate (7 df) = (121? + -.- + 1072)/2 — 41412.25 = 552.15. 
Treatment (1 df) = (423 — 391)2/16 = 64.00. 
Row (1 df) = (452 — 362)2/16 = 506.25. 


Residual (6 df) = 1331.75 — (552.75 + 64.00 + 506.25) = 208.75. 


be dn Standard error of а difference between two treatment means is 4/2(34.79)/ p 
= 2.95. The 95 per cent confidence interval on the difference in pounds sold per #6 


from the two merchandising practices is 423 — 391 F De (2.95) (2.447) = —3.2 to 4.0 + a 
= 11.2 pounds per week. The coefficient of variation is equal to 100(16)</34.79/8! 


1 
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= 12 per cent. The error variance, 34.79, is much smaller than it would have been 
without the stratification into rows. The single degree of freedom for rows is associated 
with a relatively large amount of the total sum of squares. The efficiency of the design 
relative to the comparable randomized complete block design is 
506.25 + 34.79(6 + 1) /6+1\ (8-3 
8(31.79) GE г) fe ms т) X100 755296 per ceni; 
de DOUBLE-REVERSAL DESIGNS WITH EXTENSIONS —NO RESIDUAL 
FECT FROM PREVIOUS TREATMENT 

Brandt [38] describes the analysis for a group of designs known as the 
S ouble-reversal, switch-back, or double change-over designs. Basically the 

esign involves two treatments and two sequences of treatments over the three 
periods of observation; thus: 


oS 00) m n —— 
Period Sequence of treatments 
MANT ria сз, 


I А B 
п B А 
ш А B 


If four periods are used, the sequences are 


Period Sequence of treatments 
I EY B 
п в А 
ш А B 
IV B A 


ee eee = 


ore periods. However, it is unlikely 


The above system may be used for five or m 
d of more than three or four experi- 


that the total usable period will be compose! 


Over three time periods. (Presumab 
oe to the character measured in t 
^ hen, the five subjects are randomly allo З 
reatments, The allotment of treatments 1s 


Peri ABA Sequence ВА! 
ps 1 err ien t Ge ЫТ. вао А 10050) 
—— ЭҢ} К 
I Au Ал As Aa As Ba Ba Ba Bo Ва 
1 ваны ОВО e Аа A Аара реон 
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where the symbol with the subscript represents the yield, the first subscript 
refers to the subject, and the second subscript refers to the period. The dif- 
ference between the two treatments is estimated to be 


5 10 
Abe = 2Ва + Аа) — LBa — 2А. + 24]. (XIV-1) 


with a variance equal to 


(2 (er) ber ОВО Дз = bee ВЕСЕ As) vs 


10 10 
+ > (Ba = 2А + Ва)? — 52 — 245 + Ba s]. (XIV-2) 
The above variance has 2(5 — 1) — 8 degrees of freedom. Both Brandt [38] 
and Snedecor [273, sec. 15.8] give illustrative examples of the above. 


Seath [263] devised a double-reversal design for a 2 X 2 factorial set of 
treatments; thus: 


Sequence 


Period 


I 
II 
III 


The first subscript refers to the level of the first factor and the second sub- 
script to the level of the second factor. The quadratic term (formula (XIV-2)) 
is used to test main effects; a larger error (interaction of groups of sequences 
and levels of AB) is used to test the AB interaction. This design overcomes, to 
some extent, the difficulty of comparing only two treatments and still retains 
the high precision usually associated with the designs [198]. 


XIV-2.3 CHANGE-OVER DESIGNS TO MEASURE RESIDUAL EFFECTS 


dn experiments where the treatments are applied in sequence to the same 
animal, the same store, the same field plot, etc., the effect of certain treatments 
continues after the application of the treatment is discontinued. The effect 
pua given treatment is influenced by the carry-over or residual effect of the 
previous treatment, One method of eliminating the effect of the previous 
тын 1s to insert a rest period between the treatment periods. In this way 
i пан effects are freed of most of the residual effects. However, it 18 
SF pe possible or desirable to use such a procedure. The alternative is (0 

direct eae design which yields a measurement of both residual an 
ct eltects of treatments and an adjustment of the direct effects for residu? 


$ XIV-2.3] Change-Over Designs to Measure Residual Effects 445 


effects, and vice versa. Cochran el al. [57] discuss the design and analysis foran 
experimental design which meets these requirements. Their results are given 
below for three treatments and for four treatments. 

For three treatments (А, B, C) the following design is used: 


Sequence number 


In square 1, treatment B follows treatment А on items ] and 3. In square 2, 
treatment A follows treatment B on items 5 and 6. The same balance is 
achieved for other combinations of treatments in the two orthogonal latin 
Squares. (The particular randomization of the last design in section XIV-2.1 
contains the six sequences of treatments given above. In that design, no atten- 
tion is paid to the groups or squares.) The six items should be grouped into 
two groups in such a way that the groups are as different as possible and that 
the items within groups are as homogeneous as possible. Then, a square is 
randomly allotted to the groups, and the items within a group are randomly 
allotted to the three sequences of treatments. 

1 With four treatments the following three 4 X 4 orthogonal latin squares 
yield a double change-over design for measuring residual effects: 


Sequence number 


Period 


mportant features of the de- 
quences. Lucas [198] points 
Iso satisfies the requirement of symmetry. 
of treatments is the one discussed 
for three treatments. 

etc.) the k — 1 orthogonal! 
domization procedure 


Thus, twelve items are sufficient to obtain the i 
Sign, although there are twenty-four possible se 
out that a single 4 X 4 latin square а 
E above design with twelve sequences 
elow. The randomization procedure follows that 
For other numbers of treatments (e. 5, 7, 
Squares of a k X k latin square are selected. The ran 


"These are the orthogonal squares defined in Chapter XV and not necessarily those 


described by Bose and Nair [31]. 
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follows that described above for three and four treatments. Also, residual 

effects may sometimes be estimated from less than k — 1 orthogonal k X k 

latin squares, but the effective number of replicates may be too small for the 

purposes of the experiment. Even with Ё — 1 orthogonal latin squares the 

ratio of the effective number of replicates of residual effects relative to direct 

effects is 2 to 3 for three treatments and 3 to 4 for four treatments [198]. 
The advantages of the double change-over design are: 


(i) The design allows estimation of residual effecis as well as direct effects of 
the treatments. 


(ii)A high degree of precision is usually attained with this design [198]. 
- (iii) The design is suitable for small numbers of treatments. 


The disadvantages of the double change-over design are: 


(i) The effective number of replicates for residual effects is less than for direct 
effects. 


(ii) The design is usually limited to three or four treatments. 


Lucas [198] states that carry-over effects may be estimated from certain 
Youden square and lattice square designs but that the large number of items 
(animals, stores, etc.) required is too large for most experimental setups. 

The sums of squares for group, item within group, period, period X group, 
and for the total are computed in the same manner as for groups of latin 
squares (see Chapter VI). The new features in the analysis for the double 
change-over design are indicated in the following analysis of variance table: 


SE EEE eee. 


Source of variation 2 ре 24 Ў wet ~~ 
Total corrected for the mean 18q – 1 484 — 1 
Group 24 — 1 а Pu 
Period 2 З 
Item within group 4q 9а 
Period X group 2(2q — 1) 3(3q — 1) 
Treatment (direct and residual) 4 6 
Direct (ignoring residual) 2 3 
Residual (eliminating direct) 2 3 
Direct (eliminating residual) 2 B 
Residual (ignoring direct) 2 3 
or 


The sums of squares for the direct and residual effects of treatments аге 
obtained from the following formulae. (The residual or carry-over effects TN 
partially confounded with items. The sum of squares for residual effects is 
computed euminating the effect of items, but the sum of squares for items 4 
computed ignoring the residual effects.) 
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For three treatments: 
(i) Direct effect of treatments ignoring residual effects: 
3 
Б Х-һ^/6 — X....2/18q. (XIV-3) 


(ii) Direct effect of treatments eliminating residual effects: 


[5X..4. + За — b — c + s: + зв — 2X..../3Y 
1209 


ae [5Х..в. + 2b—a-—c-4d-sd- 5 — 2X-/9]5 
120g 
[5X..c. + 2c C a — b + sı +ss— BX fot Ж. [4X....]*. XIV-4 
Y 120q 3600 EM 


(ii) Residual effect of treatments ignoring direct effects: 
[За + se + s6]? + [3b + з + s4]? + [Be + зз + so]? 
30q 
Bab à tX] Qu.) 
90g 
(iv) Residual effect of treatments eliminating direct effects: 
[8а + X.a. + so + se? + [3b + Х..в. + so + s]? + [3e + X.-c. + 1 + ss}? 
240 
_ [3(a -- b +o) + 2Х....]® (XIV-6) 
727 
For four treatments: 
(i) Direct effect of treatments ignoring residual effects: 


4 
3: X..4.2/12q — Х....2/48. 
hel 


(XIV-7) 


(ii) Direct effect of treatments eliminating residual effects: 
[IX.4.-E3a —b —e — d-E sb sic su — Ж. /ФЁ |... 
1320g 


4, Do. +3 Se Soe ee + se + ви — X..../2] 


13204 
— [10X....]2/49(1320). (XIV-8) 


(iii) Residual effect of treatments ignoring direct effects: 
2 
Ма + 5 E sod su] c ee [4d + si + ss + su] 
132q 
[Aa be D E XJ (XIV-9) 
5280 


(iv) Residual effect of treatments eliminating direct effects: 
[da + X.a. 4 set ar н]? + co + [Ad + Хор. + а + c sul? 
1209 
— [A(a + b + c + d) + 2X-...]/480q. 
The error sum of squares is obtained by subtraction. 


(XIV-10) 
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The linear model for the gth repetition of the double change-over design is 
Ху = u + os + Big + (@B) sip + 5, + Pp + eps — (XIV-11) 


where » = mean effect, o; = effect of ith period, 8;, = effect of jth sequence 
in gth repetition, (a8),;, = effect of the Jth sequence in the gth repetition ir 
the ith period, 5, = direct effect of the treatment, p, = residual effect of the 
preceding treatment, and *ijsg = random component of error; g = 1, 2, •**, 
qii = 1,2,...,Ь;] =1,2, sk — 1);h = 1,2, e,k;andp1,2,::, 
k. In the sums of squares in formulae (XIV-3) to (XIV-10), Х.... = grand 
total of all observations; X..,. = treatment totals;h = A = 1, B = 2,С = 3, 
D = 4; q = number of repetitions of the basic design; 5; = sequence total; 
J = 1,2, ---, 6 for three treatments and j = 1, 2, ---, 12 for four treatments; 
a = sum of yields of treatments which received treatment A in the preceding 
period; b = sum of yields of treatments which received treatment B in the 
preceding period; c — sum of yields of treatments which received treatment C 
in the preceding period; and d — sum of yields of treatments which received 
treatment D in the preceding period. 

For three treatments the treatment. means adjusted for residual effects 
are! 

(5X..4.+ 2a — b — c + $2 + 5 — 2Х..../3]/24а, (XIV-12) 
[SX...--2b Ca сф +s- 2X..../3]/24q, (ХІҮ-13) 

and [5Х..о. --2c — a — b + sı + ss — 2X.../3]/24g; (XIV-14) 
for four treatments the adjusted means are 

[11X..4. T3a—-b—-c-— d 4- “Hs + $19 — X..../2]/120q, (XIV-15) 

[11X..5.--3b a-c- d+ sHs ts — X.../2]/120g,  (XIV-16) 

[ILX..c.-F3c -a - b d+ s ss — X.../2]/120g, — (XIV-17) 
and[ILX..5.--3d — a - b — c+ sı + ss + su — X.../2)/120g. (ХІУ-18) 


For three treatments the formulae for the residual effects adjusted for 
direct effects of treatments are 


[За + X..,. + s2 + s] /80, (XIV-19) 

[35 + X... + ss + s] /8g, (XIV-20) 

and [3c + X..c. + sı + 55] /89; (XIV-20) 
for four treatments the adjusted residual effects are 

[4a 4- AC Huts 5$19]/30q, (XIV-22) 

[4b + Х..в. + зз + ss + 89]/30g, QI 

[4c + X..5. +, 4 Ss + 51:]/30q, (ХТУ-20) 

апа [4d + X... + s1 + ss + 7300. (ктү) 


"Note that, sequences 2 
followed by treatme 


à is 
and 6 contain treatment A in period III, that treatment B 


nt C in sequence 2 i by treatment 
їп sequence 6. Similar Ene » and that treatment C is followed by ә 


Н " airs 0! 
sequences, 3 and 4 and 1 arg Spe among the treatments are found in the other Р 
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Alternatively, the treatment mean adjusted for residual effects may be 
obtained by subtracting the mean residual deviations for the other treatments 
from the unadjusted treatment mean. For example, the treatment mean ad- 
justed for residual effects for A is equal to 


v3 Е, | formula (XIV-20) + formula (XIV-21) — rr | 


(XIV-26) 


for a design with three treatments and is equal to 


X.a. у 12a+b +0) + 6X.... 
йор [ evo» + (XIV-24) + (XIV-25) — c iia 


(XIV-27) 


for a design with four treatments. The other adjusted treatment values may 
be computed in the same manner; the adjusted treatment means will usually 
be computed from formulae (XIV-19) to (XIV-21) or from formulae (XIV-22) 
to (XIV-25). 
The standard error for an adjusted treatment mean for three treatments is 


1.25 error mean square (XIV-28) 
6q А 


апа for four treatments is 


Eres men апав ЖАГ 
120 


The design and analysis for a double change-over design with three ae 
ments are illustrated in the following example. The example is particularly 
Useful in illustrating the computation of the quantities а, b, and c. 


Ezam, - 157] conducted an experiment in retail grocery 
Stores in je AE. os pueri ОК effect on McIntosh apple sales (in Шош) 
due to Packaging the apples in 4-pound (treatment A), 6-pound (treatment. ; ), а 
8-pound (treatment C) Polythene bags (table XIV-2). Six retail grocery f ores b 
Central New York were used. Each treatment, was left in a store for one Pus а үт 
assumed that the effect of an apple purchase in one week did д exten d us 
Succeeding week. With bags of the size used in this experiment a e XE. i: y E 
Tegarded as valid. For larger bags of apples a longer period of observ: 


tr F 
eatment is recommended. with group I (stores 1, 2, 3) contain- 


Я аа Oups, 
The six stores were divided into two gr n 6) containing the largest stores. For 


Ing the up II (stores 4, 5, 1 
the mo 27 ае: store doing spp emery Teras Prr 
of business аз stores 4 and 5 should have been used in place of store 6. This * йн 
always possible, as it is usually difficult to enlist the aid of store ae an д 
experimenter must use the stores whose managers will теа t 4 d 
he store numbers were assigned at random to the stores wit ud group, 
ach group of stores was randomly assigned to the particular 3 X 3 latin square. 


TABLE XIV-2. Pounds of McIntosh apples sold per week in 4(A), 6(B), 8(C) pound 
Polythene bags in six stores 
1 A-720 В-1000 С-650 2580 

II B-900 C- 60 A-720 ako | с-1560 A-1560 B-1120 Loko 

III C-T20 A- 20 В-800 2440 | в-2090 C-178} A- 900 4774 


Total 2340 2760 2160 7260 5290 542} 2980 13694 20954 
Set total = 8) = 52 = 25 = 5) = в, = 56 


Data for adjusting yields 


Sun of yields 
Treatment of treatments in ET 2/ 2 Eram 
previous period = 


а = 504 £5 + 8g = 5700 
b = 1000 85 + 5, = 7850 


с = 4850 5, + 85 = 7764 


Store and treatment (pounds of apples) 
1 2 5 Total 


у 5 6 Total 
A-1640 B-2080 C- 960 4680 


Unadjusted Adjusted 
Mean Increase over A Mean Increase over А 
Treatment lbs lbs. per cent lb: lbs. per cent 


B 
c 


A 
B 
c 


Source of variation ре 


Group 

Period 

Period x group 

Stores within group 

Direct (ign. carry-over) 

Carry-over (elim. direct) 
Direct (elim. carry-over) 


Cerry-over (ign. direct) 
Error 


Deviations from mean 
in pounds per week 


-66 
-51 
117 


= 27332/24 = 1139 
27460/24 = 114} 
28798/24 = 1200 


+ 
х 
& 
Ӯ. 
o 
„ 
+ 
а 
Ж 
E» 
> 
П 


4 6804 
Total 
cen 
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In performing the analysis of a double change-over design [57] the first step is to 
obtain the store totals (з) over the three periods, the period totals both within and 
over groups, the group totals, and the grand total (table XIV-2). The second step is 
to obtain the treatment totals over all stores, the sum of the yields in the previous 
period, and the sum of the set totals. The sum of the yields receiving that particular 
treatment in the previous period for treatment A is equal to 


a = 900 + 800 + 1560 + 1784 = 5044, 


for treatment B is equal to 
b = 720 + 820 + 1560 + 900 = 4000, 


and for treatment C is equal to 
в = 920 + 720 + 2090 + 1120 = 4850. 


mean in pounds of apples per week is merely the un- 


The unadjusted treatment 
The adjusted means in pounds 


adjusted total divided by 6, the number of stores. 
of apples per week are 


For treatment A (from formula (XIV-42)): 


ža = зйг[5(6460) + 2(5044) — 4000 — 4850 + 2760 + 2980 — 13969} 
= 25309/24 = 1055. 


For treatment B (from formula (XIV-13)): 
£p = ъ\{5(8010) + 2(4000) — 5044 — 4850 + 2160 + 5290 — 13969} 
= 31637/24 = 1318. 


For treatment C. (from formula (XIV-14)): 
то = wir {5(6484) + 2(4850) — 5044 — 4000 + 2340 + 5424 — Eus 


= 26871/24 = 1120. 

The estimates of the carry-over effects per treatment per store in total pounds 
of apples sold in one week are given in table XIV-2 (formulae (XIV-19) to (XIV-21)). 
The method of computation is indicated in the table. For example, the carry-over effect 
for treatment A is [3(5044) + 6460 + 2760 + 2080]/24 = 27332/24 = 1139. The 
difference in carry-over effect between treatments B and A is 3432 — 3417 = —51 
— (—66) = 15 pounds of apples per week, between C and A is 3600 — 3417 — 117 
— (—66) — 183 pounds, and between C and B is 3600 — m - P рше 

Corresponding di the direct effects аге 1318 — 1055 = 263 pounds for 
ponding differences for the ЖОШ уз С-да 


and А, 1120 — 1055 = 65 pounds for C and А, and 198 
for C and В. The carry-over effect from treatment C to treatments А and B is large; 


i.e., 183 and 168 pounds, respectively. The carry-over effect from treatment B to A is 
relatively small. From a practical standpoint, this means that a purchase of 4 or 6 
Pounds of apples in one week did not affect the purchases of apples in the succeeding 
Week. However, the purchase of 8-pound bags of. apples in a given week did affect 
the purchases of apples in the next week. Evidently, 8 pounds of apples are more 
than enough for one week’s supply for the average household buying apples in Central 
New York. Also, the total sales of apples were increased by using the 6-pound bag, 
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Adjusted mean for А = 1077 — (—17 + 39) = 1055. 
d PARSER 27135525 (—22 + 39) = 1318. 
id “ * С = 1081 (—22 — 17) = 1120. 


(formulae (XIV-3) to (XIV-6)): 
Direct effects of treatments ignoring residual effects (2 df): 


2 2 2 2 
A HO ва quse 


Direct effects of treatments eliminating residual effects (2 df): 
25309? + 316372 26871? 83817? 
== йй — “зр = 181106. 
Residual effects ignoring direct effects (2 df): 
20872? + 19450? + 22314? 62636? 
30 7 790 
Residual effecls eliminating direct effects (2 df): 
27332? + 27460? + 287982 _ 83590? 
24 72 
As a partial check the sum of the first and last sums of squares above should equal 
the sum of the second and third sums of Squares given above; thus: 
and 262875 + 54941 = 317816, 
181106 + 136710 = 317816. 
The error sum of Squares is obtained by subtraction; thus, 4001172 — (2299798 
+ 25182 + 29795 + 1321777 + 317816) = 6804, ; 
The standard error of a treatment mean adjusted for residual effects is 
V 1209H./24q = V/L25E,/6g = /1.25(1701)/6 = 18.8 pounds of apples per week. 


For three treatments the hsd = 18.8(g05,.5,44/) = 18.8(5.00) = 94.01, Thus, the dif- 


= 136710. 


= 54941. 


nces of direct effects are 
2.5 — 2..4 t hsd — 263 + 94.0, 
Y.p— $26 + hsd = 198 + 94.0, 
Ze — 2.4 + hsd = 65 + 94.0, 

1The value of 90447 Was obtained from May's table of ranges [208].. 


and 
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XIV-2.4 OTHER DESIGNS 

Some additional designs exhibiting symmetrical properties are discussed in 
the present section. Other designs are possible and will arise with new experi- 
mental situations [236]. 

In the ordinary double change-over design with three treatments the repli- 
cation on direct effects relative to residual effects is in the ratio of 3 to 2 
[198]. Also, the carry-over or residual effects are partially confounded with 
items, The partial confounding makes the effective replication less than actual. 
Lucas [1 98] further points out that the direct and residual effects are positively 
Correlated and that the interest of the experimenter may be in the direct 
effect plus the residual effect [also, see 200, 235]. The latter quantity estimates 
the effect of the treatment in the second period if the given treatment had been 
applied for two consecutive periods instead of for one period. Lucas states 
that this quantity may “е of more practical importance than either the direct 
or the carry-over effects per se" and that the variance of the “direct plus 
residual effect is disconcertingly high” in the double change-over design. By 
the introduction of a single new feature to the double change-over design, 
Lucas found that the covariance between direct and residual effects can be 
eliminated and that the variances of direct and residual effects can be made 
More nearly equal. The additional feature consists of doubling the last period 
8 each square of the double change-over design. The design presented by 

ucas is 


Square 1 Square 2 


Period 


ST 
II 
III 
IV 


The doubled period is considered as two periods in the analysis. 
А variant of the preceding design is the following: 


Sequence number 


The above design requires twelve like items and four periods of observation 
for the three treatments, For four treatments, seventy-two like items are re- 
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quired to obtain the same type of symmetry described above. The eighteen 
sequences with treatment, A applied in period I are 


Sequence number 


Period 8 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 
I А A|A|A|A|A|A A 
II B D C|B|D|D B 
ш B B|C|ce|c|clnp D 
IV D B|B|D|C|C|B D 
у Cc C|DIB|D|B|C C 


The other fifty-four sequences may be obtained in the same manner. The above 
Sequences do not contain consecutive applications of a treatment in the first 
and second periods or in the last and next to the last periods. If this were 
desired, additional sequences would need to be set up. Since a relatively large 
number of homogeneous items is required, the design does not appear to be 
suitable for most experimental situations. Also, the analysis for the above 
design has not been worked out. 

The following variant of a double change-over design was set up to com- 
pare three marketing procedures in six stores over seven periods of one week 
each: . 


Store number 
Group 1 | Group 2 

Period 4 5 6 
I A B С 
п с А B 
ш B c А 
IV A B C 
V B C A 
VI С А B 
VII A B Cc 


It will be noted that the first three periods form a double change-over design. 
This is also true for the last three periods and for periods IV, V, and VI 
Residual effects enter into the fourth and fifth periods but not in the first 
period. If only three items and four, Seven, or ten periods are available, ап 
analysis is still possible, The design is useful for situations in which several 
periods of observation are available. For example, the market demand for à 


ete. weeks, and the period of observation might be one week. 


те ee 
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of observation has been developed by M. L. Richards.! The various sources 
of variation and sums of squares in the analysis of variance are given below. 


Source of variation df 8s 
Total (corrected for the mean) 4l T 
Store (ignoring direct) 5 5 
Period 6 P 
Direct (ignor. res.; elim. store) 2 р 
Residual (eliminating direct) 2 R 
Direct (elim. residual and store) 2 D' 
к Residual (ignor. direct; elim. period) 2 R 
Error 26 by subtraction 
In the above analysis 
Kr 
[y S^ Yos Зд (XIV-30) 
fel 721 
6 
vnu or ЖЫЗ 
(gus Dh irs. = AERA (XIV-31) 
à T 42" 
7 
Кыл NS 
ipis S Ba Als (XIV-32) 
>; 6 42 ' 
KERM — X,..— X4..—2X...— Kat P+ [ТЖ.в— Хи. Ак 2А. SE 
gr 672 
4 UX. .c~ Xs..— X..—2X...— Xc*]* 
672 
LOGS (Qc) G7] (XIV-33) 
672 3 
R П6(За + Х.л. Х...=а*) +3 XAF — 
21600 A * Xo*}2 
(16(36-4+.X.1.—X...=b*) +3Xp*F+[16Be+X1. =X... =e) 3с) 
ài 21600 
[1625-3 X,4*?--[160*--3X5*]*-- [16c* 3X E (XIV-34) 
21600 
ри lOXa*+ Ta P+ [6X o*+ TH+ [6c + TOF (XIV-35) 
18900 
R- CE (b*y-- (c** (XIV-36) 


108 
Xi.. — store total, X.;. = period total, X.., = treatment total for = A, B, 


апа C, X... = grand total of all observations, а = sum of yields for treatments 
us period, b = sum of yields for treatments receiving ` 


In previous period, с = sum of yields for treatments receiving C in previous 
Period, and the starred quantities are defined in equations (XIV-33) and 
1North Carolina State College, written correspondence. 


receiving A in the previo 
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(XIV-34). In addition to the above sums of squares the sum of squares for 
“permanent” effects (direct plus the residual effects) is 


[9Х„* + 23a*] + ХЕ дор aL [9Xe* ae 29] (XIV-37) 


with 2 degrees of freedom. As an additional feature in the analysis the treat- 
ment by period interaction could be computed to check the consistency of the 
treatment responses in the experiment. This is done by comparing the treat- 
ment totals in the first three periods and in the second three periods. The 
resulting interaction sum of squares has 2 degrees of freedom. The above 
sums of squares are a little out of the ordinary. This is occasioned by the 
fact that there is partial confounding between direct and residual effects, be- 
tween store and direct effects, and between period and residual effects. 

The above designs indicate the diversity of types of designs possible for 
various situations. The analysis of other designs may be developed along the 
lines described for the above design [see 236, 242]. 


XIV-3 Designs For Long-Term Experiments 


Certain agronomic experiments are conducted on the same experimental 
site for a number of years. The design, the analysis, and the interpretation of 
such experiments are not in an advanced stage. Much work still needs to be 
done on the above three phases of long-term experiments. However, enough 
work has been done on such experiments to clarify the procedures for certain 
types of experiments. Some of these are discussed below. Papers by Cochran 
[46], Crowther and Cochran [77], Dutton [99], Fisher [123, 124], and Yates 
[331] represent worthy contributions to the complex subject of long-term 
experimentation. Other papers have been written on this subject, but the 
ones listed above contain most of the pertinent information on such experi- 
ments. The comprehensive paper by Cochran is particularly enlightening in 
that he treats the general subject and presents several illustrative examples: 
the paper by Yates clearly defines the various effects of treatments and the 
general principles of long-term experimentation. 

In the preceding chapters, only the direct effects of the treatments are 
considered, while both direct and residual effects of treatments are discusse 
thus far in this chapter. Long-term experimentation results in addition? 
ERR ы, viz. cumulative effects and limiling value of cumulative 
[331] as uS The various treatment effects are defined by E 
ce ES re pertain to the application of a single ai, 4 

"The direct Hea LY to consider differences between treatment i 
tA i ofa treatment may be defined as the response to the 

e year of application, or more exactly the difference 1D 


yield where the treatment i i ined if thé 
s applied i ned 1 
treatment were omitted, ppüed over that which would be obtai 
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“The first-year, second-year, etc. residual effects may be defined as the 
differences in yield in the first, second, etc. years after application, when a 
treatment is applied in a particular year, over the yield if the treatment were 
omitted in that year. 

“The cumulative effect may be defined as the difference in yield in a par- 
ticular year resulting from repeated application of a treatment or cycle of 
treatments, over the yield that would result if the treatment were not 
applied.” 

The limiting value of the cumulative effect may be defined as the stable 
value to which the cumulative effect tends if the experiment is carried on for 
a very long time. i 

Yates further states that none of these definitions is absolute, since the 
response to a fertilizer treatment may be different on two different types of 
soils or on two areas which had different treatment previoùs to the time of 
application of the fertilizer. However, they are workable definitions in agri- 
cultural experimentation. In other areas of experimentation, it may be nec- 
essary to develop new concepts and new definitions. 

Various types of long-term experiments [46] are presented in figure XIV-1. 
In this breakdown, Cochran considers the treatments (cultural methods, 
fertilizers, varieties, time of harvesting, etc.) as different from the various 
Crops (corn, oats, wheat, sugarbeets, swedes, cotton, etc.). Thus far in the 
Present manuscript the term "'treatments" refers to the items being tested, 
but in this section the term is used in the restricted sense described above. 


The various crops are not considered as treatments. 


SN nd ЕЕЕ eee 


Information 
Treatments supplied on Crops 
i 1 
n Applied on Every year Cumulative effects Single crop айы | 
Fixed {е same Е E only Residual effects : {perennial 
plots At fixed intervals Direct and resid- | уе Fixed rotation 
ual effects 
i i i id- Effects of different 
Rotating J ele on different plots Dust and resid ped 
in successive years ual effec 


Figure XIV-1. Types of long-term experiments. 


e only evidence on certain biological 


Long-t i tion affords th : ; 
тераа experimentation. Since 


phenomena and, hence, is a very important type of d 
Such experiments are usually expensive, the number of treatments and crops 


tested and the size of the experiment are regulated by available resources. 
Such conditions make it mandatory to use good designs, to use efficient 
analyses, and to make a complete interpretation of the data. The data from 
Such experiments should be studied at the end of each year, and the results 
Should be combined after a few years [46]. Useful knowledge will be provided 
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from such analyses, and it may be possible to anticipate the final results sev- 
eral years ia advance of the conclusion of a complete cycle. If it is found that 
the wrong selection of treatments has been made after the experiment has been 
in progress for one or more years, Yates [331] suggests that it may be wise to 
abandon that experiment and to start a new one. 
In designing a long-term experiment a design should be used which permits 
the inclusion of additional treatments at a later stage, and provision should 
be made to include every course of the rotation in each period [46, 331]. 


XIV-3.1 SAME TREATMENT APPLIED ON A SINGLE CROP EVERY YEAR 


Certain agronomic experiments on a single crop are conducted over a 
period of several years. A treatment is applied to the same field plot year after 
year. Several such plots may be laid out side by side with the different treat- 
ments on different plots but with the same crop being used year after year on 
all plots. 

In an attempt to obtain information on the various treatment effects 
under a system of continuous cropping, Fisher [123, 124] applied the method 
of orthogonal polynomials to an experiment on wheat and involving various 
fertilizer treatments. He fitted the following polynomial to a long series of 
yield data: 


Y; = A + BX; + CX? + DX} + EX^-- FX5, — (XIV-38) 


where Y; = yield in year i, the X’s are the components of the rainfall curve, 
and the coefficients of the X’s are regression coefficients. Fisher’s method 
has been applied to numerous sets of data by various workers. However, since 
the equation is of limited usefulness in predicting the limiting value of yield 
and since the biological explanation of a mechanism which will generate a 
polynomial does not appear reasonable, some other function of time should 
be used to explain the yield from a plot in the ith year. For example, the fol- 


RA equation does approach a limiting value for given functions of А, B, 
and C: 


Y; = А + BCH, (XIV-39) 


Dutton [99] used this approach to describe the yields from continuous сгорріпё 
with fixed treatments over a period of several years. 

Other approaches (e.g., harmonic analyses) to the problem are possible 
but require further investigation from both the statistical and the biologic? 
viewpoints. After the analyses by the various methods and the interpretatio" 
of the results on several sets of data are performed, sufficient empirical eV” 
dence should be available to formulate some general principles. Even Wit 


present methods of analysis, much knowledge is to be gained from the analysis 
of data from long-term experiments, 


. Validates the estimate of the error variance 
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XIV-3.2 EXPERIMENTS ON A SINGLE ROTATION OF CROPS 
А An established agronomic practice is to rotate various field crops on a 
given plot. The particular rotation of crops might consist of corn, oats, red 
clover, red clover with one-fourth of the field planted to corn, one-fourth to 
oats, and one-half to red clover. A particular area would be planted to corn 
one year in four, to oats one year in four, and to red clover for the remaining 
two years. Such a rotation of crops as described above is called a four-course 
rotation. Two-course, three-course, etc. rotations are in common use in farm- 
ing areas around the world. One of the courses in a rotation might be a period 
of no-cropping. Various types of rotation experiments are discussed by Coch- 
ran [46], Crowther and Cochran [77], Yates [331], and in the references listed. 
Cochran [46] states that the most important rule about rotation experi- 
ments is that each crop in the rotation should be grown in each period. This 
should be accomplished even at the expense of replication on a single crop. 
Some replication on a crop should be made in order to perform an analysis on 
each crop for each year. If replication within a year is not possible, then the 
years may be used as replicates. Also, in order to facilitate cultural operations, 
all treatments in a replicate on a crop should be laid out in adjoining plots. 
This is called a series. (In a single year a series might be called a whole plot, 
since the design is comparable to a split plot design with the crops constituting 
the whole plots and the treatments constituting the split plots within a whole 
plot.) From a practical standpoint, it is advisable to have the replicates on а 
crop close together in order to facilitate cultural operations, but from the 
statistical viewpoint it is recommended that adjacent whole plots or series in 
a replicate be used. The latter layout usually allows observations over а 
wider range of conditions, since replicates which are farther apart tend to be 
more unlike than replicates closer together. Also, more precise comparisons 
among crops, if desired, are obtained with such a design. 
Example XIV-3. A long-term experiment in which the fertilizer treatments were 


discontinued is described by Cochran [46]. Previously, four treatments 4 = organic 
from corn meal, and C and D = arti- 


potash, and phosphate as found 


ficial man i i ts of nitro, 
ures with equivalent amounts of nitrogen plots on each of four 


in treatments А and B, respectively—had been applied to four 
Series with one crop of a four-course rotation п 
course rotation consisted of mangolds, barley, peas, and wheat grown on a plot in 
that order. The cycle was repeated in the next four years. The field plan for five years 


is given in table XIV-3. г , 1 

The object of the analysis of the yield data in table XIV-3is to examine the yields 

of barley (grain) to determine the persistence of effects from previous fertilizer treat- 

ments. In other words, the residual effects of treatments are compared over three 

complete cycles of a rotation after discontinuing previous fertilizer treatments, and the 
rate of deterioration in yields is examined. 

ceca ts (see field plan in table XIV-3) in- 


The systemati t of the treatmen! 
d Ee aioe of the average effects. Also, the comparisons 


30 


Linear trend = differences between 


TABLE XIV-3. Yields of barley grain in lbs per 14 acre, totals, and plan 


Yield data 
тї 


Previous treatments 
A B c D 
First cyc 


1886 
1887 
1888 
1889 


Second cycle 
1890 - 
1891 = 
1892 = 
1893 = 


Third cycle 


FUNE 


2 
6 
T 
8 


588 580 528 
555 370 h22 439 
360 361 255 


566 hog 


Series I Series II Series III 


Series IV 
Treatments Treatments Treatments 


Treatments 


Peas Mangolds 
Mangolds Wheat Barley 


Barley Mangolds Peas 
Peas Wheat 


yields on same plots in first and third cycles 
(pounds per 14 acre) X (—1) = decrease in yield 


Previous treatment 
B [d 


ho 35 19 
15 58 82 76 


75 7» 102 81 
hg 55 68 


Quadratic component of yields on same plot 
second cycle yield) — (first + third cycle yields) 


= 2 
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among series are partially confounded with comparisons among years. Nevertheless, 
the data are used to illustrate the analysis of a long-term experiment. 

The totals directly under the original yield data are merely the sums of yields for 
the three yields of barley grain in the years that barley occupied that particular plot; 
for example, 588 = 207 + 214 + 167. The other totals in this table are obtained 
Similarly. The first analysis of variance in table XIV-4 is for these data. The third 
set of figures in table XIV-3 represents the decrease in yield between the first and 
third years; for example, on the plot previously receiving treatment A in series IT 
the decrease in barley grain yields per À acre between the years 1886 and 1894 is 

-207 — 167 = 40 pounds of grain per } acre. There is a downward trend in yields 
after stopping the previous fertilizer treatment. The analysis of variance on these 
data is presented in the second part of table XIV-4. The divisor for the individual 
Squares is 2, since two original yields make up each difference. The last set of differences 
represents the curvilinear (quadratic) effect of the decrease. This is estimated by 2 
(yield from the second cycle) — (yield in the first and third cycles); for example, the 
first difference is equal to 54 = 2(214) — (207 + 167). The other differences are ob- 
tained similarly. The analysis of variance on these differences is presented in the last 
part of table XIV-4. The divisor for the squares of each of these differences is equal 
to (2? + (—1)? + (—1)?) = 6. The analyses on quadratic and linear regression on 
years follows the usual form described in standard textbooks on statistical methodology 
[273]. Also, the standard errors of the individual figures or means in table XIV-3 are 
computed in the usual manner. 

Owing to the nature of the design the estimated erfor for average treatment effects 
is to be questioned. However, if the error variance is considered as an estimate of the 
true error variance, there is no indication of real differences among the average yields. 
The totals for treatments A, B, and D seem surprisingly close together. ү 

Inspection of the linear trends indicates that the yields following inorganic manures 
deteriorated faster than those following organic manures. Àn examination of the 
quadratic regressions indicates that yields following treatments A and C were particu- 
larly well maintained during the second cycle of the experiment (1890 to 1893). f 

Cochran [46] discusses a more accurate estimate of the deterioration in yields 
than the one given immediately above. Let (1), (2), ***: (12) denote the yields de 
a plot receiving a specified treatment in the first, second, ***, twelfth years. du 
Dlot errors in the twelve years are independent and have equal variances, o.”, the best 
estimate of the average decrease in yield per year is given by 

L = йд) + 901) 4-700 +55 18) – 90) —10h — — 

With a variance equal to 02/143. Likewise, the above equation may be rewritten in 
the following form: 


L = qi [(3do + в — bo — 3ш) + 8а + bi +а + a), 
Where the following relationships hold: 
Totals 


=@+@0+{ «0-0 
b= @) +) +0] bi = 00) = 0) 
Co 
do 


Linear components 


= (11) — (3) 
= (3) + (7) + (11) а = (11) – ( 
= (4) + (8) +2) а = 012) – (0) 
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TABLE XIV-4. Analyses of variance (unit = 14 acre per year) 


Average effects of treatments 


Series 
Treatment 
Error 


Series 
Treatment 


Error 


Series 
Treatment . 218.8 
Еггог 539.5 59.9 


see | 


The quantity L’ = (a, + bj + c + d;)/32 estimates the decrease in yield per year 
and а, b, с, and d, are the quantities used in the analysis of variance for linear тев) 
sion on years. The variance of L’ is equal to GALE EET Be 


32 x 32 

= 74/128, given that the plot errors are independent. Therefore, ‘te fractional 1058 
in information due to using Z’ instead of L is (143 — 128)/143 = 15/143 = .105, oF 
a loss of about 10 per cent. Since the two quantities L’ and L” = (3do+ co — bo — 3a0)/' 30 
are subject to different variances the loss is somewhat less than 10 per cent. 
E fact, Cochran states that, little additional information is to be obtained in this 
experiment if L is used instead of L’, He presents a similar discussion for the quadrat 
regression on years. 


XIV-3.3 COMPARISON ОЕ SEVERAL ROTATIONS 


There are many types of rotations that are in agronomic usage. The com- 


AMET ү direct and residual effects from various rotations may d 
be foll Бр uir Ways. For example, uniform or continuous cropping coul 
Owed tor a time [331]; then, the various rotations could be started а0 
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conducted through one or more complete cycles, after which continuous crop- 
ping of the test crop could be followed to evaluate the residual effects of the 
various rotations. Yates [331] and Cochran [46] discuss this and other ex- 
amples. The reader is referred to these excellent papers for a clear and concise 
discussion of comparisons among rotation experiments and of the conduct of 
long-term experiments. к 

The field layout for the comparison of two three-course rotations is discussed 
below. Since the year-to-year variation is usually considerably larger than 
the plot-to-plot variation within a year, it is desirable to grow every crop in 
the rotation each year. Thus, if the two rotations consist of С = corn, О = oats, 
S = soybeans and С = corn, R, = first-year red clover, R, = second-year 
red clover, the following field plan for a single replicate might be used: 


Plot number 


The cycle is completed in three years. The crops on a given plot in the fourth 


year are the same as in the first year. i | 
A comparison of a two-course and a three-course rotation would require 


multiples of five plots and six years to complete a cycle. Thus, for two rota- 
tions, C = corn, О = oats and C = corn, O = oats, S = soybeans, one 
replicate of the field plan might be represented as follows: 


Plot number 


5 
о 
с 
о 
с 
о 
C 
о 


It is recommended that more than one replicate and more than one location 
be used for rotations of the above type. The sequences should be randomly 


allotted to the plots. Also, additional information on cultural operations, 
fertilizer treatments, etc. may be obtained by splitting the plots and applying 


n Ө 
е various treatments. 


CHAPTER XV 


Some Additional Designs 


À few additional designs are included in the present chapter. 'These de- 
Signs plus those in the preceding chapters include most of the experimental 
designs encountered in experimental work, Although the designs discussed 
below are seldom used in experimental work, they are included for complete- 
ness, for the infrequent occasions when they are encountered, and for use in 
understanding the design and analysis of other designs. А more complete 
account of the designs discussed herein may be found in the references cited. 


ХҮ-1 Three-Way and Higher-Way Grouping of 
the Treatments 


In the randomized complete block design, only a single grouping of the 
treatments is used. In the latin square design, two-way grouping of the treat- 
ments is utilized. Further stratification of the set of treatments is possible 
by grouping the treatments in other ways. The purpose of additional strati- 
fication is to control sources of variation that cannot be controlled by less 
stratification. 

In making use of the designs in this section, caution should be used in 
selecting the experimental material and in using experimental material subject 
to several major sources of variation. If the treatments interact with one 0! 
the sources of variation (say, rows or columns), the tests of significance for 
treatment differences may be invalidated because of the confounding of effects. 
Also, if there is an interaction between the types of stratification, tests of SIE 
nificance and estimates of effects are invalidated because of the confounding 
of effects, 

‚ Fisher [126, sec, 35.1] states that the principal use of graeco-latin а 
igher squares consists in clarifying complex combinatorial situations, bU 
2d Fon are occasionally useful directly in experimental work [98, 126, ay 

» 324]. Likewise, the main use of latin cubes, graeco-latin cubes, and hyP* 
graeco-latin cubes (126, 184, 186] is in the clarification of concepts associate 


with the more complex incomplete block designs and with complex eoa 
natorial situations, 
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XV-L1 GRAECO-LATIN SQUARES 

In an ordinary latin square design, each treatment occurs once in each row 
and once in each column. The treatments are usually designated by Latin 
letters, A, B, C, etc. А second set of variates, say Greek letters, o, 8, y, etc. 
may be superimposed on the Latin letters (treatments) in the latin square in 
Such a manner that the Greek letters appear once in each row, once in each 
column, and once with each Latin letter (treatment). For example, the only 
3 X 3 graeco-latin square possible is [126, sec. 35] 


The partitioning of the 8 degrees of freedom for the nine cells follows: 


Source of variation 


Row 
Column 
Latin letter 
Greek letter 


In order to observe the relationship of the above to confounding in fac- 
torial experiments, let А = 0, B = 1, C = 2, а= 0,8 = 1,7 = 2; then, the 
&raeco-latin square becomes 


The partitioning of the 8 degrees of freedom for the two pseudo-factors 


а and b is 


Source of variation 
Row = i АВ), (АВ?),, and (AB?) = AB? 
Салан comparison among (АВ A NO (AB) = in 
Latin letter = comp. among (A)o, (А) and (А) = А 
Greek letter = “ а ° (В), (В), and (В) = 


t2 r2 to ю | E 


w differences and the 
-latin square. The А 
letter effect. 


Thus, the AB? effect is completely confounded with го 
B effect with column differences in the above graeco 
effect is the Latin letter effect, and the В effect is the Greek 
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Nothing is left unspecified in the 3 X 3 graeco-latin square; this results in 
zero degrees of freedom and zero sum of squares for residuals. The above 
illustrates the orthogonality! of the two latin squares involved in the 3 X 3 
graeco-latin square and the analogy between the separate components of 
variation in the graeco-latin square and in the factorial experiment. 

Since there is no error term for a 3 X 3 graeco-latin square, it is necessary 
to use sets of 3 X 3 graeco-latin squares and to use the square by treatment 
interaction as the error for treatments. The analysis of variance for s 3 X 3 
graeco-latin squares is 


p à———————ÀÁM———ÁÁ— «ne y 


Source of variation df 
Square з= 1 
Column within square 2s 
Row р: $ 2s 
Greek letter within square 2s 
Latin “ (treatment) 2 
Treatment X square 2(s — 1) 
Total 98 — 1 


eee 


Though there is only one 3 X 3 graeco-latin square, there are seventy-two 
possible arrangements. The twelve possible arrangements for a 3 X 3 latin 
square (Chapter VI) and the three Greek letters may be permuted among 
themselves in six ways, resulting in seventy-two arrangements. In using the 
3 X 3 graeco-latin square, one of the seventy-two arrangements should be 
selected at random. An alternative method is to select at random one of the 
twelve arrangements of the 3 X 3 latin squares and then assign the Greek 
letters to the variates at random. 

As stated in Chapter VI, there are four standard 4 X 4 latin squares and a 
total of 576 arrangements [126]. Of the four standard squares, only one, 


"At this point the concept of o 1 7 The term 
has been used in Eae р! rihogonal latin squares needs to be defined. ( 


; ш hapters but was not defined.) Fisher and Yates [129] give the 
folowing definition of orthogonal latin squares and their мА to graeco-latin squares 
e a said ае аге orthogonal to each other if, when they are superimposed, every ис 
(one square bein, жий and only once with every letter of the other. Such a pair of BIA A 
Еа E n with Greek letters and the other with Latin letters) form а 618 
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yields a graeco-latin square [126, sec. 35]. There are two 4 X 4 graeco-latin 
squares from among the three orthogonal 4 X 4 latin squares [129]; thus: 


There are 144 = 413! arrangements for a 4 X 4 standard latin square, twenty- 
four ways of permuting the Greek letters among themselves, and two orthog- 
onal graeco-latin squares, resulting in a total of 6912 — 144 X 2 X 24 
arrangements, Їп laying out a 4 X 4 graeco-latin square, select one of the 144 
arrangements of the latin square at random. Then, select one of the two types 
of graeco-latin squares at random, and assign the Greek letters to the variates 
at random. This process assigns equal probability of selection to each arrange- 
ment. The analysis of variance breakdown of the degrees of freedom for the 
4 X 4 graeco-latin square design is: 


Eee ee 
Row 3 
Column 3 
Greek letter 
Treatment (Latin letter) Н 
c Ri (M аы 
15 


ra ee 


Of the fifty-six standard squares [129] for the 5x5 Jatin square, only six 
Standard squares yield graeco-latin squares; each of the six yields three dif- 
ferent squares which differ in ways other than a randomization of the Greek 
letters [126, sec. 35]. There are 3 X 6 X 24 X 120° 5 X 5 graeco-latin square 
arrangements. The randomization procedure follows that for the 4X 4 
8raeco-latin 

By еу all the actual types of squares, Fisher and Yates [128] 
have established the fact that no 6 X 6 graeco-latin square exists. They [129] 
list the two 3 x 3 orthogonal latin squares, the three 4 X 4 orthogonal latin 
Squares, the four 5 X 5 orthogonal latin squares, the six 7X 7 orthogonal latin 
Squares, the seven 8 X 8 orthogonal latin squares, and the eight 9 X 9 orthog- 
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onal latin squares. Graeco-latin squares have been constructed for all num- 
bers of treatments from three to thirteen with the exception of six and ten; 
Cochran and Cox [60] include plans for the3 X 3,4 X 4,5 X 5,7 X 7,8 X 8, 
9 x 9, 11 X 11, and 12 X 12 graeco-latin squares in their book. Graeco-latin 
squares exist for all odd numbers. It is suspected from the nature of the modulo 
notation that no 14 X 14 graeco-latin square is possible, since 14 is composed 


of mixed primes (i.e., 2 X 7 = 14) in the same manner as are the numbers 
2 X3 = 6 and 2 X 5 = 10. 


XV-1.2 HYPER-GRAECO-LATIN SQUARES 


More restrictions are placed on the grouping of the treatments (Latin 
letters) in the hyper-graeco-latin square than in the graeco-latin square de- 
sign. In the latter design each Latin letter appears once with each Greek 
letter. In the hyper-graeco-latin square design suffices (more alphabets [126]) 
are added and each of the letters appears once with each of the numbers. If no 
interactions among the treatments and sources of variation exist, the hyper- 
graeco-latin square design may be used to control the additional sources of 
variation. 

Fisher [126] forms the following 4 X 4 hyper-graeco-latin square by letting 
one of the sets of Greek letters in the two 4 X 4 graeco-latin squares be 
numbers or suffices 1, 2, 3, and 4: 


Again, it is possible to set up the analogy between the hyper-graeco-latin 
square and confounding in factorial experiments. The sixteen cells may be 
considered as the combinations obtained from four pseudo-factors each at tW? 
levels, or a 2* factorial arrangement, with the following scheme of association 
between the factorial arrangement and the 4 X 4 hyper-graeco-latin square: 
Ehe Eno epo tos 0 турег-втаесочайп эче шш 


Elfects in 2 Item of hyper- - Degrees of 
factorial graeco-latin square freedom 
A, BC, ABC Row 3 
B, AD, ABD Column 
C, BD, BCD 


3 
Latin letter 3 
D AC ACD Number 3 
‚ CD, ABCD Greek letter 3 
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Among the four row totals, there are 3 degrees of freedom which correspond 
to the 3 degrees of freedom associated with A, BC, and ABC; analogous 
relations exist between the other sources of variation. In laying out the 4 X 4 
hyper-graeco-latin square, one of the 2 X 144 X 24 X 24 possible arrange- 
ments is selected at random. Alternatively, one of the graeco-latin squares 
may be selected at random, and then the numbers are randomly assigned, 
Since there is no residual variation in the 4 X 4 hyper-graeco-latin square, 
it is necessary to use more than one square and to use the square by treatment 


interaction as the error variance to test treatment differences. The breakdown 


of the total degrees of freedom follows that outlined for sets of 3 X 3 graeco- 


latin squares. 
design of Greek letters and one suffix 


The 5 X 5 hyper-graeco-latin square 
[126, p. 16], where each treatment 


may be likened to the Knut Vik square 

(Latin letter) appears once in а row, once in a column, once in a diagonal 
in one direction, and once in a diagonal in the opposite direction; or it may be 
likened to a 5? factorial with the following association among the effects and 


the Latin letters, the Greek letters, and the numbers: 


Effect of 5? Item in hyper- Degrees of 
— — factorial graeco-latin square freedom 
A Latin letter 4 
B Greek letter — first diagonal 4 
AB Row 
AB? Column 
АВ? Number = second diagonal 1 


AB* Residual 


Other associations among the effects and the categories of the hyper-graeco- 
latin square may be substituted for the one above, since the effects are pseudo- 
effects. The association presented above illustrates the relationship puo 
the factorial experiment and the hyper-graeco-latin square. Ifa second set о 


numbers (or suffices) is used, it corresponds to the АВ“ effect. m aded 
in the 5 X-5 latin square are speci ero degrees ot 1r 


fied; there are 2 
А A е of more than one 
ero sum of squares for the residu: s 


al, necessitating the u mes 
i i ment, dif- 

Square in order to obtain an error mean square for comparing treat 

ferences, 


i i es for a 
_ Nissen [232] has shown that there are two 5 X 5 к үн ori de 
given assignment of the Latin letters to the treatments. аел: 
те randomly allotted to the treatments, there лыш е 
е Knut Vik square. A random selection of one of the arra з 2 
‚ There are a ictal of 6x6x24X 1205 X 5 Hyper graeco o о 
with Latin letters, Greck letters, and numbers; there are E zw еч ү 
ifferent hyper-graeco-latin squares for Latin letters, Gree: ema 
number suffix, and second-number suffix. In setting ир an experimen gn, 
Опе of the possible arrangements is randomly selected. 
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Tn all graeco-latin squares and hyper-graeco-latin squares, randomizations 
may be obtained by randomizing all rows, all columns, all Latin letters, all 
Greek letters, etc. of a given square. For larger squares, some of the arrange- 
ments will be excluded by this method, but the number of arrangements pos- 
sible will probably be large enough unless these designs are used extensively 
and unless it is desired to summarize the results from all such designs. 

Graeco-latin square and hyper-graeco-latin square designs may be particu- 
larly effective in controlling and locating sources of variation in various fields. 


XV-L3 LATIN AND HYPER-GRAECO-LATIN CUBES AND HYPERCUBES 


Although the latin squares and hyper-graeco-latin squares (including the 
graeco-latin squares) were first studied in the latter part of the eighteenth 


Diagrammatic representation 
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pendently published on the completely orthogonalized hyper-graeco-latin 
cubes and hypercubes of first order; he was investigating systems of confound- 
ing in the general symmetrical factorial design at the time. Shortly after this 
(1946-47), Rao defined certain combinatorial arrangements, which he called 
hypercubes of strength d, and showed that for d — 2 the hypercubes are 
exactly equivalent to latin and hyper-graeco-latin cubes and hypercubes of 
the first order [186]. 

The natural extension of the latin square and the hyper-graeco-latin 
square is to a latin cube and to a hyper-graeco-latin cube. The extension from 


Diagrammatic representation 


Y 
Schematic representation 


or to the hypercubes. Instead of considering 
k? elements as in a latin square, А? elements are considered. There are k layers 
of k? elements with the three sets of such layers intersecting at right angles. 
Thus, any two layers of the different sets will intersect in k common dman 
(figure XV-1). Kishen [186] defines a latin cube of first order of edge k as a 


cubes is to higher dimensions, 
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cube arrangement of k letters, each repeated Ё? times, such that each letter 
occurs exactly k times in each of the three sets of planes parallel to the three 
coordinate planes OY;Y;, OYiY, and OXY. The diagrammatic and 
schematic representations of this are given in figure XV-1. The schematic 
form of presenting the 3 X 3 X 3 latin cube of first order is more convenient 
and less cumbersome than the former. The analogy between the rows (or 
columns) in the squares and the sides of the cube is evident from an 
examination of figure XV-1. 

A latin cube of second order of side k is defined to be a cube arrangement 
of k? letters, each repeated k times, such that each letter occurs exactly once 
in each of its three sets of planes parallel to the coordinate planes ОҮ, Үз, 
OYiY;, and OY2Y;. Thus, in a latin cube of second order, there are k? letters 
instead of letters as there are in a latin cube of first order. The diagrammatic 
and schematic arrangements for a 3 X 3 X 3 latin cube of second order are 
given in figure XV-2. 

A further discussion of rth order latin cubes and latin hypercubes and of 
their use in the theory of general symmetrical factorial arrangements and in 
balanced incomplete block designs may be found in Kishen's paper [186]. 
This reference plus the others listed in this section discuss hyper-graeco-latin 
cubes and hypercubes as well as orthogonality among the cubes [see 113]. 


XV-2 Quasi-Latin Squares 
XV-2. CONFOUNDING IN LATIN SQUARES " 


The subject of confounding in the randomized complete block design has 
been discussed in previous chapters. Only brief mention of confounding in 
the latin square design has been made thus far (sections IX-3.2, X-1.4, and 
XIII-3; problems X-7 and X-9). This subject is important in experimental 
design because it may be necessary to use two-way elimination of experimental 
heterogeneity and to use more treatments than there are rows and columns in 
the given latin square. (In some experimental situations the lattice square 
design may be appropriately used.) For certain treatment combinations, it is 
possible to arrange the treatments in a latin square design in such a way that 
the less important interactions are confounded with row and column dif- 
ferences. Since these designs resemble ordinary latin squares in that row and 
column differences are eliminated from the experimental error and since they 
differ from latin squares in that each treatment does not appear once in a row 
and once in a column, Yates [324] proposed the name quasi-latin squares for 
these designs. He constructed a number of the designs and presented appro- 
priate analyses. Їп laying out the designs, all rows and all columns must be 
rearranged at random in order to obtain an unbiased estimate of the error variance. 
This is true for all quasi-latin squ; 


F ares includi "plai id latin 
squares discussed in the following аалы, жеры ri i 
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Prior to selecting a particular quasi-latin square design the experimenter 
should note which effects are confounded with row differences. Also, an outline 
of the procedure for computing the sums of squares in the analysis of variance 
should be made prior to conducting the experiment. The latter information 
points up the difficulties associated with the designs, the effects confounded, 
etc. The procedure for computing the various sums of squares may be ob- 
tained from the results presented in Chapters VI through IX. Plans for the 
various quasi-latin squares and key-outs for the degrees of freedom in the 
analysis of variance are tabulated in various places [see 60, 140, 175, 253, 324]. 

The design for and the analysis of a 2? factorial in two 4 X 4 latin squares 
with the columns of the two squares interlaced is discussed in section IX-3.2. 
The three-factor interaction is completely confounded with square differences. 
Various alternative schemes of confounding the effects from a 23factorialin4X4 
latin squares may be used. For example, the following plan of partially con- 
founding the interactions with row and column differences might be utilized: 


Square I Square II 
(BC), (BC), (AB) (AB) (АВС), (АВС), (AC) (AC) 


(АВС), 011 110 000 101 (ВС), 011 11 000 100 


(АВС), 111 010 001 100 (ВС), 110 010 101 001 
(АС), 000 101 11 010 (АВ), 000 001 11 110 


(АС), 100 001 110 011 (АВ), 101 100 010 011 


The key-out for the degrees of freedom in the above design is 


Source of variation df 


Square 

Rows within square 
Columns “ T 
Main effects (A, B, C) 
AP' 

AC’ 

BC’ 

ABC’ 


Residual variance 


See кы ы со о о 


m 


Total 
tit ow.) pe Totals yt „ДН ee Le I БаНе 
All interactions are partially confounded with row and column differences. 
Only one-half of the relative information is retained on the partially con- 
founded effects. Consequently, unless the interactions are of little importance 
or unless the added stratification controls a considerable amount of the residual 
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variation, it may be well to use an incomplete block design with plots of four 
treatments; if each interaction is confounded with incomplete block differences 
in one of the four replicates, three-fourths of the relative information is re- 
tained on each interaction (section IX-4.1). 

If three 4 X 4 latin squares were to be used with 23 treatments, the follow- 
ing additional square could be added to the above two: 


a L _... 
Square III 
(A)o (А), (АВС)» (АВС), 


(B)o 001 101 000 100 
(В), 010 110 011 111 
(C)o 000 100 110 010 
(C) 011 111 101 001 


The above design allows for five-sixths relative information on the main effects, 
two-thirds relative information on the two-factor interactions, and one-half 
information on the three-factor interaction. The key-out of the degrees of 
freedom in the above design is 


= RENE 


Source of variation df 
SS l 8 
Square 2 
Rows within square 9 
Columns within square 9 
Main effects (A’, B’, C’) 3 
Two-factor interactions (AB', AC’, BC’) 3 
Three-factor interaction (ABC’) 1 
Residual 20 
SS eee 

Total 
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Other schemes of confounding could be used. The three-factor interaction 
could be completely confounded with rows in all three squares, A and BC 
with columns in square I, B and AC with columns in square II, and С and 
AB with columns in Square III. Also, the three-factor interaction may be 
eee ely confounded with rows in all three squares and one of the two- 
yale interactions may be completely confounded with columns in one 9 
5 MB The former design allows for no information (i.e., intrablock) on 

E effect and for five-sixths information on the other effects. The latter 
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Similar schemes of confounding in latin squares with 3", 4", 5", etc. treat- 
ments are possible. 'The reader is referred to Yates [324] for a more compre- 
hensive discussion of confounding in latin squares. 


XV-2.2 HALF-PLAID LATIN SQUARES 


The half-plaid latin square is a quasi-latin square in which a main effect, 
instead of an interaction, is confounded with row differences. The levels of 
the factor, which are applied to the rows of the latin squares, should be 
allotted at random. No grouping of the treatments is permissible if an unbiased 
estimate of the error variance is to be obtained. Thus, the design for the treat- 
ments applied to entire rows in the latin square corresponds to a completely 
randomized design. The additional degrees of freedom in the error variance 
for the treatments applied to the rows somewhat compensate for the inefficient 
design [324]. 

The following example is used to illustrate the design and analysis for 
the half-plaid latin square. Suppose that four levels (21, v2, vs, and v4) of the 
factor v, or four varieties, are applied to the rows of an 8 X 8 latin square. 
The varieties are randomly allotted to the eight rows of the square. The 2? = 8 
treatments within the 8 X 8 latin square are randomly allotted in the manner 
described in Chapter VI. The following field arrangement represents one of 
the possible layouts for a 4 X (23) half-plaid latin square (the notation p X (k) 
is used to denote the fact that the p levels of a factor are applied to the whole 
rows and the k = 2? treatments are the treatments within the k X k latin 
Square): 


If E is set equal to the comparison —?: — v2 + vs + 04, if D is set equal to the 
comparison —ю; + v; — v3 + v, if DE is set equal to the comparison v; — v; 
— Vs + v, and if the levels of the three factors а, b, c correspond to the first, 
Second, and third numbers, respectively, the ABE, BCD, and ACDE effects 
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are confounded with column differences. The key-out for the degrees of free- 
dom in the analysis of variance is 
Source of variation df ms 
Row 1 
Variety 
Residual 
Column 7 
Confounded effects (ABE, BCD, ACDE) 
Residual 
Sub-plots 49 
Treatment 1 
А 
B 
AB 
с 
АС 
BC 
ABC 
Variety X treatment 18 
Variety ХА 
Variety X B 
Variety X C 
Variety X AB’ 
Variety X AC’ 
Variety X BC' 
Variety X ABC 
Error 24 E 
Total 63 


© 
t 
E 


mo 
p 


——————- 


оз кю кә № WWW 


The error mean square E, may be used to test the treatment and the treat- 
ment X variety effects if only these varieties and levels of а, b, and c are con- 
cerned in making inferential statements about the treatments. Likewise, Ё: 
and E, represent the error mean squares for comparison with the mean squares 
for confounded effects (ABE, BCD, and ACDE) and for varieties, respectively. 

If two such squares are used with different randomizations, the following 
breakdown of the degrees of freedom in the analysis of variance is made: 


Rows within square 14 
Variety 3 
Variety X square 3 
Residual 3} 

Columns within square 14 
Effects confounded with columns 3 
Effect X square 3 
Residual H 

Sub-plots within Squares 98 
"Treatment 
Treatment x square 
Treatment X variety 


Treatment X square X variety 18’ 


Residual within Square 48 Ee | 
ae es pw 17.7 dd _ ' E = | 


127 


E, 
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The above designs are useful in applying additional factors to a latin 
square design that has already been laid out in the field or laboratory. For 
ехатріе,! an 8 X 8 latin square design on eight cabbage varieties might be laid 
out. Later in the season, information might be desired on four fertilizers (or 
four insect sprays).With proper precautions in applying the fertilizers to specific 
rows in the latin square, additional information is obtained on the effectiveness 
of fertilizers as well as on the relative yields of the eight cabbage varieties. 


XV-2.3 PLAID LATIN SQUARES 

If one set of factors is applied to the rows and a second set of factors is 
applied to the columns of a latin square design, the resulting quasi-latin 
square is called a plaid latin square. Yates [324] used this term because the 
resulting square after randomization resembles a “typical Scotch plaid." 
The relationship of the half-plaid latin square to the plaid latin square is 
evident and is the reason for the name, half-plaid. : 

The notation for the plaid latin square represents an extension of that for 
the half-plaid latin square. For example, a 2 X 2 X (2?) plaid latin square 
is one with two levels of one factor, say a, applied to the columns, two levels 
of the second factor, say b, applied to the rows, and the 2? treatments, say 
two levels each of the factors c and d in all combinations, arranged in a 
4 X 4 latin square; the schematic arrangement is 


(А), 
0000 | 0001 | 1010 | 1011 
(B)o 
0011 | 0010 | 1000 | 1001 
1100 
1110 


1M, T. Vittum, Geneva Experiment Station, N. Y., used such a design. 
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The effects confounded with the various columns and rows are (indicated by 
the symbol х) given below. 


Row number Column number df 
кес 1 2 3 4 1 2 3 4 confounded 
A x x x x 1 
B XX XU X! cx 1 
AC С 5 xe X 1 
BD хе ox 4 
ABC x xc > x 1 
ABD x x i 
BCD x x 4 
ABCD x x i 
Total df confounded 6 


"Thus, effects B, AC, and ABC are completely confounded with row differences 
and A with column differences. Effects BD, ABD, BCD, and ABCD are par- 
tially confounded with column differences with one-half relative information on 
each. The degrees of freedom confounded add up to the 6 degrees of freedom for 
rows and columns. In order to obtain an error variance for some of the effects, it 
isnecessary to use more than one 4 X 4 square. The reader is referred to the 
paper by Yates [324] for a more comprehensive treatment of these designs. 


XV-3 Magic Latin Squares and Super Magic 
Latin Squares! 


In certain instances it may be desirable to place additional restrictions on 
the grouping of the treatments within a latin square design. For example, in 
а k X klatin square, ће k treatments might be grouped into r X s = k smaller 
Squares or rectangles. Also, the grouping might be in more than one direction. 


The 6 X 6 magic latin square listed below illustrates the arrangement of 
treatments.? 


1Th 1 " ina; 
unpublished results,” 19 designs were developed by G. М. Cox, Univ. of North Caroline 


*This square has been used Ь К, 
М.Т. Vittum, Geneva Experiment Sick 


Borden, Hawaiian Sugar Planters’ Assoc., and bY 
оп, in field experimentation, 
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In the above square, each treatment appears once in a row and once in a col- 
umn, and all treatments appear together in a given row or a given column. 
Thus far the plan is a typical latin square arrangement. In addition, all six 
treatments appear within each 3 X 2 rectangle marked by the dotted lines. 
This further restriction on the allocation of treatments must be taken into 
consideration when making an analysis. 

The randomization procedure is to assign the treatments to the letters 
at random, to assign the sets of three columns to the left part and to the right 
part of the experimental area at random, to randomize the three columns 
within each set, to randomize the sets of two rows, and then to randomize 
within the sets of rows. All treatments then fall together in a row, in a column, 
and in a 3 X 2 rectangle. 

The breakdown of the total degrees of freedom in the analysis of variance 
for a 6 X 6 magic latin square is 


HH OZO 


Source of variation df 
Row 5 
Column 5 
Treatment 5 
Rectangle (elim. row and col. effects) 2 
Error 18 
Total 35 


The total, row, column, and treatment sums of squares are computed in the 
same manner as previously described for the latin square design. The remain- 
ing sum of squares with 20 degrees of freedom is partitioned into two parts, 
one with 2 degrees of freedom representing the sum of squares among rectangles 
(eliminating row and column effects) and the other with 18 degrees of freedom 
representing the sum of squares for the experimental error. 

An examination of the following design is helpful in formulating the analy- 
sis of variance for the design: 


Column number 


3 | 4 


Row no. 


Rectangle 1 Rectangle 2 


2 Total = X... 
3 Rectangle 3 Rectangle 4 
4 Total = X... Total = X... 
5 Rectangle 5 Rectangle 6 


Total = Х..., 


Total = X...5 
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The sum of squares among the totals, X... (let X;;,, = yield of a single 
observatión), for the six rectangles may be partitioned into the following five 
orthogonal contrasts: 


Ry = (Xia + Xia + Xa) — Quad Xt X4, (KV-1) 


Ry = (Xa + Х..з) — (Xu. + X.a), (XV-2) 
Ra = (Qa Xie + Xa XL) — UK 4X, (КУЗ) 
В, = (Xia X) — (Xia + Х..з), (ХУУ) 


апа 
Bs = Qa Xa 2X4) – (Xia + Xie 2X4). (XV-5) 


The contrast R, represents a column contrast; Вә and R; represent contrasts 
among row totals. Only the contrasts R, and R; represent comparisons which 
are not row or column comparisons. Since the contrasts are orthogonal and 
Since each is associated with a single degree of freedom, there are only two 
degrees of freedom associated with the sum of squares, 
Rè Rè 
ва+іғі+р= иена 14у (XV-9 

among rectangles (eliminating row and column comparisons). The divisors 
are obtained as the sum of the squares of the coefficients times the number of 
items making up each total X ee 

The error variance is obtained by subtraction; thus, total ss — 
+ column ss + treatment ss + rectangle (eliminating row and colu 
= error ss. Computation of standard errors and of tests of significance proceed 
in the same manner as for the ordinary latin square design. 


Still further grouping of the treatments may be made; 


(row ss 
mn) ss) 


thus: 


BAG UE 


M - 
гт 
* : 


In the above super magic lali; 


n square the six t; 
a column, in a 3 X 2 rectan 


{ reatments all appear in a row, in 
gle (indicated by 


Safani ), and in a 2 X 3 rectangle 
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(indicated by «++++). The additional restriction on the allocation of the 
treatments must be accounted for in the analysis. With proper randomization 
the breakdown of the degrees of freedom in the analysis of variance is 


Source of variation df 


Row 5 
Column 5 
Treatment ` 5 
Residual 20 
3 X 2 rectangles (ign. 2 X 3 rect.) 2 
2 X 3 rectangles (elim. 3 X 2 rect.) 1 
Error 16 


Total 35 
ee ee ————— 


The sum of squares for the 3 X 2 rectangles is given by equation (XV-6). 
Contrasts similar to those given in equations (XV-1) to (XV-5) may be set 
up for the 2 X 3 rectangles; thus: 


Sy = (X... Xia + Xia) — (Ca Ket Х....), (ХҮ-7) 


S, = (X... + X.) — (XK... + X....5) (ХҮ-8) 
S = (Х.Х... ЕХ... БХ...) — 2(X ad АХ...) (XV-9) 
S, = (Х...а + X...) — (X... + Х....), (XV-10) 


and 
S, = (Х...1 + Х...2 + 2X...) — (2Х...» + Х...4 + X....5), (XV-11) 


where X...., represents the total for the six individual yields, X;j,,, in the 
2 X 3 rectangles: 


Column number 


The contrasts S; and Ss are the only ones which do not represent row or 
column comparisons. Therefore, the sum of squares due to the 2 X 3 rec- 
tangles (eliminating row and column comparisons) is given by the expression, 


(= = Sy (XV-12) 


with one degree of freedom. The total, row, column, treatment, and rectangle 
(3 X 2) sums of squares are computed in the same manner as for the magic 
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latin square. The error sum of squares with 16 degrees of freedom is obtained 
by subtraction. 

Other groupings of the treatments, such as graeco-magic-latin squares, are 
possible. It is doubtful if these designs will attain prominence in experimental 
work, since the additional stratification of the treatments over the latin square 
will not control any considerable amount of the heterogeneity in most experi- 
mental situations. Unless the experimenter is certain that a relatively large 
amount of the extraneous variation will be controlled by the additional group- 
ing, it is recommended that the latin square design be used instead of the magic 
or super magic latin square designs. 


XV-4 Weighing Designe 


Several papers [7-10, 162, 174, 185, 211, 249, 254, 319] have been written 
on the problem of weighing several light objects on a chemical balance, spring 
balance, or other weighing device. This problem was first introduced by 
Yates [319]. He states that the information obtained by weighing each of 
seven objects separately and then making an eighth weighing with no object 
to determine the zero correction is only one-fourth that obtained by weighing 
the seven objects according to the following design: 


Weighing Objects weighed 


a, b, c, d, e, f, g 
a, b, 
а, с,е 
а, f, g 
b, c, f 
b, e, g 
c, d, g 


d, e, f 
-= T ©. * 


This illustrates the great gain in efficien 
appropriate design. In the weighin, 


ONDUA WN = 


cy that may be made by using the 
g design described by Yates, every object 
is weighed four times and every object is weighed twice with every other ob- 
ject. The difference between the first four and the last four weighings is an 
estimate of four times the weight of object a. 


Р The last seven weighings represent a balanced incomplete block design 
withv=7,b=7,r= 3, k = 3, and y = 1. Also, the above weighing design 
may be related to the estimation of effects in the 22 factorial when the factors 
do not interact. If the weighings 1, 2, 3, .. *, 8 are numbered 111, 110, 101, 
100, 011, 010, 001, and 000, respectively, then the weight of the object a 18 


үне by the comparison giving the A effect. Also, the estimates of the 

ку ts of the objects b, c, d, e, f, and g are given by the estimates of the effects 
› C, AB, AC, BC, and ABC, respectively. 

bovecited references for a more comprehensive 


nd of weighing designs for other situations. 


The reader is referred to thea’ 
discussion of weighing designs a 


CHAPTER XVI 


Covariance 


XVI-1 Introduction 


The methods for reducing or controlling experimental variation are 
[126, 127, 218, 273]: 
(i) Selection of homogeneous material and (or) of a uniform environment. 
(ii) Stratification (grouping) of the material and (or) of the environment into 
homogeneous subgroups. 


(iii) Refinement of experimental techniques. 
(iv) Measurement of related variates and use of covariance analysis. 


If the above procedures fail to furnish adequate control of the experimental 
variability, then the number of replicates will need to be increased to the 
point where adequate precision is attained. 

Local control (methods (i) and (ii)), randomization, and replication form 
the basis for valid experimental designs. If the material and the environment 
are uniform, then a completely randomized design is used. If the material 
and the environment are grouped into homogeneous subgroups with enough 
material in each subgroup to compare all treatments, a randomized complete 
block design is used. If the subgroups are not large enough to compare all 
treatments, an incomplete block design is used. Other situations may require 
the use of additional grouping of treatments in order to attain homogeneity 
of the material in the subgroups (e.g., the latin square design). 

Some discussion on the refinement of experimental technique (method 
(11)) is given in Chapter III and elsewhere in the text. In most instances 
the variability associated with a given technique is relatively small in ex- 
perimental work. However, cases do arise in which technique errors add 
considerably to the variability. Technique errors should be guarded against, 
but they should not be overemphasized. 

The use of concomitant observations or covariates may be effective in 
reducing the experimental variability over and above the reduction obtained 
by stratification of the experimental material. Alternatively, the use of re- 
lated variates may be substituted for one or more of the other methods in 
certain experimental situations. A rule to follow here might be: if the experi- 
mental variation cannot be controlled by stratification, then measure related 
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variates and use covariance. Also, it may be more advantageous to use covari- 
ance than to use stratification, since fewer degrees of freedom are usually 
required to control the variation. On the other hand, stratification may be 
used to control variability even when no measurement of variability is 
available, whereas covariance could not be used. 

The covariates in covariance analyses should be selected with care, and 
the results should be interpreted in light of the particular covariates used 
and of the particular values of the covariates obtained [13]. The effects 
eliminated should be irrelevant to the objectives of the experiment, and the 
tests of significance of treatment comparisons should not be invalidated [126, 
sec. 55]. In a large number of experimental situations the differences among 
treatment means for the covariates should be random differences and not treat- 
ment differences. If the treatment means for the covariates represent real 
treatment differences, then it may not be advisable to use covariance analyses. 
Instead, a multivariate analysis may be required to assess the true treatment 
differences in the observed characters. The particular analysis to be used 
depends upon the nature of the experiment. 

In order to use the average within regression, the individual regressions 
must be estimates of the same regression. Also, the individual treatment 
variances adjusted for covariation with the independent variates must form 
a homogeneous set of variances if the pooled variance is to be used in tests 
of significance. In addition, the deviations from regression should (theo- 
retically) be normally and independently distributed in order for the resulting 
i, F, and x? tests to be valid [126, 127, 273]. 

In some experimental situations the regression of Y on X will vary from 
treatment to treatment [158]. The use of an average regression in such cases 
is often misleading. In the analysis of data in which the treatment regres- 
sions differ, the standard errors for treatment differences become rather com- 
plex; also, the use of covariance is to be questioned. The fact that the treat- 
ment regressions differ may be sufficient reason for conducting additional 
experiments on the subgroups of treatments which do have common regressions. 

Covariance analyses for a number of the designs discussed in previous 
chapters are illustrated below. The covariance analysis for other designs is 
a straightforward extension of the methods presented herein. In addition to 
covariance analyses for various designs, examples illustrating the control of 


уша tion by covariance instead of by stratification, and other selected topics 
are included. 


XVI-2 Completely Randomized Design 
{ The covariance analysis for a с 
in most textbooks in statistics a 
classification or as 


ompletely randomized design is described 
S the covariance analysis for a one-Way 


an “ ар s 3 
among groups and within groups covariance analysis- 
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Snedecor [272, 273] presents several examples and discusses the results for 
several covariance analyses [also see 58, 80, 84, 86, 119, 126, 127, 158, 218, 
252, 308, 314]. 
The completely randomized design consists of v treatments randomly 
allotted to the г. = Jor: experimental units with г; replicates on the ith 
i=l 
treatment. The linear model for covariance 15! 

Ү;= ud ri + В(Х; — 2) + 6 (XVI-1) 
where Y;; = the yield of the jth observation on the ith treatment, и = popu- 
lation mean, т; = effect of ith treatment, Ху; — 2 = deviation of the ijth 
covariate X from the mean of the covariate, 8 — common regression for all 
treatments, and є;; = random error component. 


The key-out for the degrees of freedom and for the sums of squares is 
presented in table XVI-1. The sums of squares for both the variate Y and 


TABLE XVI-1. Covariance analysis for a completely randomized design 


Sum of products Aàjusted ss 
Source of variation 

Ге Ги s s^ zer] GREEN 

rel ; 


Total 


Group 


Within 


Group adjusted for the average 
within group regression 


the covariate X are obtained in the usual manner for a completely randomized 
design, The sum of the cross products of Y and X is obtained as follows: 


т = УУУХ ОЛКЕ, (XVI-2) 
i=l j=l 

Xi. Yi. XVI-3 

a = EDS Хез (XVL3) 


and P 
Еш = Ta ~ Gs {хок " Жыл. (XVI-4) 


The average within group linear regression coefficient is estimated by 


bye = E./E:. (XVI-5) 


Ivo] T S 2 Aby 
and the average within group correlation is estimate 
гу = Е/М ЕЕ (XVI-6) 
ЧЕ desired, the constant 82 may be included with и, and an estimate of p’ = и — BE 
may be obtained in place of the estimate for и. 
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The reduction in the error sum of squares due to fitting a linear regression 
coefficient is equal to x 
EE, = г,2Е, = Ь,?Е,.., (XVI-7) 
with one degree of freedom. 
The F test for the comparison of the group mean square adjusted for 
within-group regression is 
ССр 9 — 1) XVI-8) 
dua) Seay ( 
The group (treatment) means adjusted for the average within-group regression 
are obtained from the formula, 


dí = й:— М — 2). (XVI-9) 

The adjustment b,.(z; — 2) is subtracted from the unadjusted mean, 7:, to 

obtain the adjusted treatment mean. И 

The estimated variance of a difference between two adjusted means is 

[119, 218, 314] 

E. ас z») 1-10 

RUD To Ezz > av ) 

where rı = number of replicates on treatment one, r; = number of replicates 

on treatment two, 2, = mean of X values for treatment опе, z; = mean of 
X values for treatment two, and f. = degrees of freedom for E 

The above covariance analysis pertains to a single group of treatments. 

In many situations the treatments may be partitioned into subgroups, or 

the treatments may form a factorial arrangement. The covariance analysis 


for such situations is presented in table XVI-2. In the table, 
(A, + By)’ = Ay, + Eyy — (As, + E4/ (Az + Ez). (XVI) 


The other two sums of squares, (By, + Е.) and (Ry, + E,,)', are similarly 
obtained. The tests of significance and adjustment of the means proceed in 
the same manner as described previously. 


When several lines in the analysis of variance are to be adjusted for the 


error regression, the adjusted sum of squares may be evaluated as Ay, — 2bAzv 
+ UA, where b = 


E;,/ E. and the other values are defined in table XVI-2 
[60, sec. 3.86; 127, sec. 49.1]. An adjusted sum of squares for A, say, obtained 


ч this manner is equivalent to the sum of squares among adjusted means; 
thus: 


А» — 9A. + PAs. = MY. ~ 9 b. — py, (KVA) 
pores m number of individuals in Y:.. and the other symbols are as defined 
above. Since this sum of Squares contains a component due to error of ad- 


justment, it is not Strictly comparable to Ey, = Ey, — Е. 2/Е,, for testing 
the null hypothesis. The resulting F values will be too high; the amount 0 
overestimation in F depends upon the differences between treatment means 
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TABLE XVI-2. Covariance analysis for subgroups of treatments in a completely ran- 
domized design 


— 0 
2 zo s Alen 
T, T, T, - 


ху хх 


Residual 
(Interaction) 


Error 


УУ 
Error + A + 
A adjusted É by subtraction 


Error + B 


B adjusted by subtraction 


sidua R R +5 )' 
Error + residual R ( yy" и) 


Residual adj. by subtraction 


for the covariate. Since this component of error is relatively small in a number 
of cases, the above method of obtaining the adjusted sums of squares is useful. 


XVI-3 Randomized Complete Block Design 


The linear model for a covariance analysis for a randomized complete 


block design is , 
Yy ud rid pi + BG; — €) + 6i (XVI-13) 


where x, ту, ру, В, and ei; are the mean effect, the effect of the ith treatment, 
the effect of the jth replicate, the average regression coefficient from the error 
line, and a random error component, respectively. The quantity QG; — 2) 
represents the deviation of the observed X;; from the mean of the experiment 
for the covariate. : 

In the above linear model the assumption is made that all treatments are 


r variation after correction for regression. Also, it is 


subject to the same erro K 2 
assumed that the error regressions for the different treatments are all esti- 


mates of the same parameter 8. Àny effect of the treatments on the mean of 
the covariate must be considered when interpreting the results from a co- 


variance analysis. 


The analysis of covariance for a randomized complete block design is 
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presented in table XVI-3. The average within treatment and replicate re- 
gression, £, is estimated by the quantity, 


b, = Ezy/ Ezz. (XVI-14) 


The reduction in the error sum of squares due to fitting a linear regression 
of the covariate X on the variate Y is obtained from equation (XVI-7), and 
the adjusted treatment means are obtained from formula (XVI-9), using 


the value of b obtained in equation (XVI-14). Tests of significance are made 
in the manner described in the preceding section, 


TABLE XVI-3. Covariance analysis for a randomized com 


plete block design 


Source of 
variation 


Вери. - R 
leplicate r-l Ry Ly, Re 
Treatment v-l 


Error (r-1)(v-1) 


2 
yy 
t= 
Treutment M (тууу) Ty yy 
* error E 2 
(туву) /(т Е) 


The treatment sum of s 
parts, and each part ma 
procedure follows that o 

Snedecor [273, sec. 
analysis. The yields, Yi, 


the increased tonnage per acre, 
If several samples or individuals are ob 
the problem of which regression to use 
random samples are obtained from each 
model is of the form, 
Yin = p+ Ti + p; + B(X dev) + є; + Din, (XVI-15) 
where и, 7;, and p; are defined in 
є; represents a random componen 


tained from each experimental unit, 
arises. To illustrate, suppose that $ 
experimental unit and that the linear 


the same manner as in equation (XVI-13) › 
t associated with the i ijth experimental unit, 
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0j, represents a random component associated with the hth sample in the 
ijth experimental unit, 8 is the regression coefficient, and the (.X dev) depends 
upon the regression used. The (X dev) in ordinary field experiments 


is equal to! 
Cu et жы} _ {ле 2 
(X dev) c { $ CENT = E — Zz, (XVI-16) 
where Х;;. = experimental unit total, X... = experiment total, г = number 


of replicates, v — number of treatments, and s — number of samples per 
experimental unit. The computation of the regression coefficient from the 
experimental error in the analysis of variance applies to most experimental 
situations and is similar to the preceding example. However, there are situa- 
tions in which the regression is computed as the average regression within 
experimental units. In this case the (X dev) is equal to 


(X dev) = Xi, — z. (XVI-17) 


The (X dev) in formula (XVI-17) corresponds to the (X dev) given for the 
completely randomized design with the treatments in a factorial arrangement 


(table XVI-2). 
XVI-4 Latin Square Design 


The covariance analysis for the latin square design is a straightforward 
extension of the results given in the previous sections [127, sec. 49; 273, Ch. 
12]. The linear model for the yield of the ijhth observation from a latin square 


design with a covariate is 
Yin = ud ret тк B(Xis — 2) + ej» (XVI-18) 


where и = mean effect, y: = effect of ith column, p; = effect of jth row, 
74 = effect of hth treatment, @ = regression coefficient of residual Y varia- 
tions on the residual X variations, and ел = random error component. The 
key-out for the degrees of freedom and for the sums of squares in the covari- 
ance analysis of an experiment arranged in a k X k latin square design is 
given in table XVI-4. The average error regression, 8, is estimated by the 


quantity, 
b = EQ/E« (XVI-19) 


the average correlation of the error deviations i$ estimated by 
nA— E. EE, (XVI-20) 


are obtained from formula (XVI-9), using 


The adjusted treatment means 
). The variance of a difference 


the value of b obtained in formula (XVI-19 
e of the covariate is available on the experimental unit 
thin the unit (e.g., stand per plot), and, therefore, the 
by formula (XVI-16). 


In some experiments the value 
but not on each individual item wi 
(X dev) will necessarily be that given 
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between two adjusted treatment means is obtained from formula (XVI-10) 
with appropriate substitutions. 

The treatment sum of squares in the latin square design may be par- 
titioned into its component parts and treated in the manner described in 
section XVI-2. If a group of individuals or samples constitutes the experi- 
mental unit, the problem of which regression to use is treated in the same 


TABLE XVI-4. Analysis of covariance for a k X k latin square design 


Treatment adjusted for average error regression 


manner as described in the previous section. Also, heterogeneity of regression 
coefficients or of residual variances and treatment differences in the covariate 
cause the same problems in the latin square design as in other designs. 


Example XVI-1. In the production of double cross hybrid corn an advantageous 
procedure is to use a single cross which produces no viable pollen (male sterile single 
cross). The detasseling of the female line is eliminated. Before a male sterile single 
cross can be substituted for its nonsterile counterpart, it is necessary to ascertain 
whether or not yields from the resulting double cross hybrids differ. Н. L. Everett 
conducted a number of these yield tests comparing the standard single cross with 
its male sterile counterpart. Theoretically, the sterile and nonsterile single crosses 
should differ only in the characteristic of sterility. 


The yield and stand data presented in table XVI- i ieti l 
crosses) related in the following manner: ML T ee 


Other single 
cross in the double 
cross combination 


: Nonsterile 
single cross single cross 
zi =I 


A=1xI F-1xI 
C=2xII 
D-3xII 


1Cornell Univ., unpublished results, 
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from the additional work, since the unadjusted and adjusted means are approximately 
equal. The procedure followed in covariance analysis should be to decide upon the 


covariate prior to studying the d 
variate to use, the resulting tests of significance and the use of tabulated probabilities 


are invalidated. 


TABLE XVI-5. Yields (Ibs./plot) of ear corn (Y) and number of plants (X) of 6 corn 
varieties (letters) in a 6 X 6 latin square design (systematic order of 


design given below) 


о 


m 
сос 


The sums of squares for both the variate Y and the covariate X are computed in 
the usual manner for a latin square design. The various sums of products of the variate 


and covariate are obtained as follows: 


Tolal sum of xy products: 


^ 615(256.5 Д 
18(8.6) + 16(7.5) + +++ + 15(74) — 82502565) = 10175. 


6 
Row sum of xy products: 


98(42.2) + — + 106(13.2) 9136565) = 4.708. 
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Column sum of xy products: 


100(44.2) + +++ + 95(40.7) =, 615(256.5) _ 3.358. 
6 36 a 
Variety sum of xy products: 
102(42.5) + +++ + 112(31.1) 615(256.5) —95.208 
6 36 Asse 


Error sum of xy products: 
—10.175 — (4.708 + 3.358 — 25.208) = 6.967. 
It is instructive to compute the error sum of zy products directly from the deviations 
(Xijy — ži.. — Ey. — 2... +22) and (Yin — di. — J.j- — J--n + 29) [273, ex. 12.13]. 
Since the treatment design was selected for specific contrasts, the treatment sum 
of squares was partitioned into three parts: S — steriles vs nonsteriles, P — compari- 
son among three pairs of crosses, and S X; P — interaction of pair with type of single 


cross. The sums of the zy products for these comparisons are obtained as follows from 
the totals in table XVI-6. 


TABLE XVI-6. Unadjusted and adjusted means for 6 corn varieties 


Variety Los. of corn (Y) Stand (X) Adjustment for Adjusted 
Total Meen Total Mean nean [~b(1,-x)] mean 


12.5 08 7.09 

Ў .22 8.30 

7.09 

1.66 

1.60 

5.02 

AF 1 17.85 -0.077 6.05 
B+C 91.8 .6 200 16.67 0.042 7.69 
E+D 91.1 .59 201 16.75 0.034 7.62 


ax 
mno 
ов 


Sum of xy products for S: 
187.2301) + 119.3(314)  615(2565) _ (137.2 — 119.3)(301 — 314) 
18 36 36 та 


6.464. 


Sum of xy products for Р: 


214(73.6) + 200(91.8) + 201(91.1 
2 (91.1) _ 91502565) _ _ 13.417. 


Sum of xy products for S X P: 
—25.208 — (—6.464 — 13.417) = —5.327. 


The adjusted sums of squares for error and for error + S are 


28.939 — (6.067,67 667 = 28.222 
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and (—6.464 + 6.967)? 


(8.900 + 28.939) — 1691-4 671667 ^ 37.839 — .003 = 37.836. 


The sum of squares for S adjusted for the average error regression is equal to 37.836 
— 28.222 — 9.614. The other adjusted sums of squares are similarly obtained. 

An alternative partilioning of the treatment sum of squares is set forth in the 
bottom part of table XVI-7. These comparisons among the treatments are the ones of 
importance if the interest lies in the difference between members of a pair. The sums 
of products are obtained in the manner described above. 

The F tests for the comparison of thé mean squares for S, P, and S X P with 


the error mean square are 


СЛОЕ 1 
F = Тв = OAT > Р, 19df) = 5.92, 
p-999 — 6.03 > Fu (2, 19df) = 5.93 
= 1495) «^ е5 Ж Үч 
апа 
236 
pu 3.236 = 218 < Ро (2, 19df) = 2.61. 


Thus, if the 5 per cent level of significance (per line in the analysis) were being used, 
the hypothesis of no difference among the pairs of crosses and between the double 
cross containing sterile and nonsterile single crosses would be rejected. The evidence 
of interaction is slight. From the alternative covariance analysis, it is noted that most 
of the difference in the S comparison is derived from the single contrast A vs F. 

The adjusted treatment means are obtained from formula (XVI-9) and are pre- 
sented in table XVI-6. The standard error of a difference between two adjusted treat- 
ment means (e.g., А vs F) is computed from formula (XVI-10) and is equal to 


„(1 1 (17.00 — 18.67)? у _ 
1.485 {2 + 6 es CU = 0.746. 


The coefficient of variation is equal to 
/1.485/1.125 = 17 per cent. 


It was assumed that stand was not a characteristic of the double crosses used. 
This assumption does not appear to be violated when the variety mean square for 
stand is compared with its error; thus, Ё = 4(19.917)/67.667 = 1.2. However, when 
the treatment regression, —25.208/19.917 = — 1.266, is compared with the error re- 
gression, 6.967/67.667 = 0.103, it is found that these two regressions differ. The sum of 

$ (—25.208)? , (6.967)? (—25.208 + 6.967)? 
squares for this contrast is equal to 19.917 67.667 ^ 19.917 67.667 
= 31.905 + 0.717 — 3.799 = 28.823, with one degree of freedom, and F = 28.823/1.485 
= 194 > Fo (1, 19df).1 For these crosses, poorer stand and higher yield charac- 
teristics appear to be associated together (see table XVI-6), but for a given double 
cross the relationship of yield with stand is positive, though not significantly so. 
Also, the negative linear relationship between yield and stand for variety means 
accounts for most of the variely sum of squares; thus, the residual sum of squares is 
32.413 — (— 25.208)2/19.917 = 0.508, with 4 degrees of freedom. The difference in 


1This assumes that treatment and error regressions have the same variance. 


TABLE XVI-7. Analysis of covariance 


Source of variation 


Sum of products 
e xy x 


-10.175 


Error +5 
Error + P 
Error + (S x P) 


Adjusted results 


19 28.222 1.485 
20 51.856 
21 46.126 


Error 
Error + S 
Error + P 
Error + (S x P) 


5 
P 


SxP 


Source of variation 


Among progenies (P) 
Within progenies 


Avs F 
Bvs C 
E vs D 


Error 


Misi wenn 
Error . 
6 


Error + P 

Error + (A vs F) 
Error + (Вуз С) 
Error + (E vs p) 


P adjusted for average regression 
A vs F adjusted for average regression 
Вуз С " " " " 


E vs D i " x " 
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yield between the two groups involving sterile and nonsterile single crosses may be 
due to a change in the stand characteristic. Evidently, stand is a varietal characteristic, 
and any use that is made of the results should be made with this in mind [273, sec. 12.5]. 


XVI-5 Split Plot Design 


The linear model for a split plot design with a covariate may be one of 
the following two forms, depending upon the assumptions made by the 
experimenter: 

Үз» = n + pj + ri + б; аһ + (ar)in + Bii. — z) 
+ B(X ijn — y.) + єз; (XVI-21) 


$3 Үз = p + pit ri + б; + аһ + (a7)in + В(Х у, — z) + єгүһ. (XVI-22) 


In the first equation, it is assumed that the regression, 81, for whole plot 
deviations is different from the regression, 62 = 8, for split plot deviations. 
In equation (XVI-22), it is assumed that there is only one regression, £. 
The key-out for the degrees of freedom and for the sums of products in the 
analysis of covariance, under the assumption that equation (XVI-21) is the 
appropriate linear model, is given in table XVI-8. If equation (XVI-22) is 
deemed to be the appropriate linear model for the covariance analysis, the key- 
out for the degrees of freedom and for the sums of prodacts is as presented in 
table XVI-9. 


TABLE XVI-8. Covariance analysis for a split plot design* 


Sum of products Adjusted ss 
Source of variation 
T 
Replicate R, be, = - 


R 
у - - 


Variety 


DO SIS 
уу Ауу Ау ocx 


ай) te - г 
r(v-1)-i Wy А М 
у-1 VOU WIS ALT 

MEI ORE LUN 


Error (a) 


Treatment 


Treatment x variety 


Error (b) 


Treatment + error 


Trt. x var. + error 


Treatment adjusted for split plot regression 


Treatment x variety adj. for split plot regression 


"In the above, the varieties represent the whole plot treatments and the treatments 
represent the split plot treatments. 


496 Созагїапсе [$ XVI-5 


TABLE ХУІ-9. Alternative covariance analysis for the whole plots in a split plot design? 


Sum of products Adjusted ss 


Source of variation 


Replicate 
Variety 
Error (a) 


Error (b) 2 ке 
yy 


2 
5 f y By 
Variety + error (b) fytv-1 Р Р RS. tpv- 26 я 
E ' 
Error (a) + error (b) Erf. Sy; 95 qx fol E 
vel уу 


Variety adjusted for split plot regression 


Error (a) adjusted for split plot régression 


"The remainder of the covariance analysis is that given in the bottom part of table X VI-8. 


If no knowledge is available as to which equation ((X VI-21) or (XVI-22)) 
is more appropriate, the results from several similar experiments may be 
summarized, and the regression coefficients may be tested for homogeneity 
[273, Ch. 12]. Then, on the basis of these results the experimenter may be 
able to decide which model is more suitable for his purposes. However, for 
most field experiments it is suggested that equation (XVI-22) be used [260]. 
One further consideration in favor of using the error (b) regression to adjust 
sums of squares and means is the small number of degrees of freedom usually 
associated with the error (a) sums of products. However, this procedure en- 
counters difficulties in making tests of significance. Н. F. Smith (written 
correspondence) has shown that V,,'f./Ay,/(v — 1) in table XVI-9 is not 
distributed as Snedecor's F and has suggested that the ratio of the mean 
squares adjusted by formula (XVI-12), say Vafz/Aa(v — 1), be used [also, 86 
14, p. 145]. Further work is required in order to clarify tests of significance 
when the analysis is of the form given in table XVI-9. ^ 

f The adjusted means are obtained from formula (XVI-9), using the appro" 
priate regression coefficient. The variance of an adjusted ions is given by 
equation (XVI-10). The comments relative to homogeneity of regression СО- 
efficients and of residual variances apply here as well as in preceding sections- 

' In asplit split plot design, there is the possibility of one, two, or three regres- 
sion coefficients, depending upon the assumptions made by the experimenter 
Likewise, in designs with additional splitting of the plots, there is the possi- 
pud А а two, 177 Or more regression coefficients, The analytical proce 
von i ec are determined along the lines discussed for the spli 
^ à wd d BH (6) and error (c) sums of products usually will b^ 
ssociated with sufficient degrees of freedom to permit relatively accurate 
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determinations of the regression coefficients, insofar as the accuracy of these 
regressions depends upon the number of degrees of freedom. 


XVI-6 Split Block Design 


The problem of determining the appropriate regression model arises in 
the split block design in the same manner as in the split plot design. If the 
whole plot treatments are subject to different regressions, the linear model 


for a split block design with a covariate is 


Yan = p pj + ri t Os + «+ Yia + (ar)in + eus + Big. — 2) 
+ 8з(2;.„ — 2) + Bs(Xijn — Tije — Tjon + 2). (XVI-23) 


If there is only one regression, the linear model is 


Yisn = et ppt Ti + бу on Y (ar)in+ eua + B(Xijn — 2). (XVI-24) 


In the above equations, y — mean effect, p; = effect of jth replicate, т; = ef- 


fect of whole plot treatments in one direction, « = effect of whole plot 


treatments in the other direction, (ar)in = effect of ith treatment in one set 


on Ath treatment in the second set, 5:; = random component of error variation 
associated with ijth plot, yi, = random component of error variation asso- 
ciated with jhth plot, ej 4 = random component of error variation associated 
with ijhth plot, 8: = regression coefficient associated with residual variations 
of Y on X for the first set of whole plot treatments, 8: = regression coefficient 


associated with the second set of whole plot treatments, and b; = В = re- 


gression coefficient associated with residual variations of Y on X within 


whole plots. 
If equation (XVI-23) is the appropriate linear model, the key-out for the 


degrees of freedom and for the sums of products in a covariance analysis of a 
split block design (let treatment = whole plot treatments in one direction 
and let variety = whole plot treatments in the other direction) is given in 
table XVI-10. If equation (XVI-24) is the appropriate linear model, then the 
covariance analysis is similar in form to the second key-out for the split plot 
design (table XVI-9). The sums of products С, Czy, and Cz are added to 
the comparable sums of products in the other lines in the analysis of co- 
variance; the treatment means are adjusted by formula (XVI-9), using the 
regression coefficient b — С.С. 

The adjustment of means and the computation of standard errors under 
the two linear models proceed in the same manner as discussed in the pre- 
vious section. More complex designs of the split block nature give rise to 
more regression coefficients and to a linear model of the form given by equa- 
tion (XVI-23). If a single regression coefficient suffices, as in equation (XVI- 
24), the procedure follows that outlined above. 

f freedom associated with the T X replicate 


The number of degrees o А 8 
and the V X replicate mean squares is usually relatively small. Àn equation 
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TABLE XVI-10. Covariance analysis for a split block design 


Sum of products Adjusted ss 


Source of variation 


Replicate 
Treatment 
Trt. x replicate 


Trt. + (trt. x rep.) 


Variety 
Variety x replicate 


Var. * (var. x rep.) 


Treatment x var. 
Rep. x trt. x ver. 


Trt. x var. + 
(rep. x trt. x ver.) 


Trt. x var. adjusted for within whole plot regression 


similar to (XVI-24) will be appropriate for many field experiments. There- 
fore, it is recommended that equation (X VI-24), or a similar one, be used as 
the linear model for a split block design unless it is known that equation 


(XVI-23) is the appropriate linear model (see preceding section regarding 
tests of significance). 


XVI-7 One-Restrictional Lattice Designs 


Covariance analyses in lattice desi 
Some of these complications are enc 
far, while others represent new 
covariance analysis of a lattice 
is to be recovered. Cornish [ 
analysis of covariance witho 


gns are complicated by several factors. 
ountered in the designs discussed thus 
problems. The first question that arises in the 
design is whether or not interblock information 
66] and others [183, 252] have discussed the 


AE. ation the next question that arises is the one 
pertaining to the appropriate 


Cochran [48] states that the following thr icati i i 4 
a e : he ех 
pectation of the block (elimin 4 RACE MUR 


intrablock error regression is 
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degrees of freedom are not orthogonal with the degrees of freedom for regres- 
sion (the correct coefficient is slightly smaller than (r — 1)k/r because of the 
non-orthogonality; see section XVI-12.3), (ii) the variance of a difference 
between two adjusted treatment means is not of the form given by equation 
(XVI-10) but involves the block and intrablock mean squares for both the 
variate and the covariate, and (iii) the use of the intrablock error regression 
coefficient introduces a correlation between the pseudo-effect estimated from 
the replicates in which the effect is unconfounded with incomplete block 
differences (intrablock estimate) and the same pseudo-effect estimated from 
the replicates in which the effect is confounded with incomplete block dif- 
ferences (interblock estimate). The use of separate regression coefficients for 
interblock estimates and for intrablock estimates eliminates the difficulties 
mentioned in (i) and (iii) above. However, the calculation of the adjusted 
means becomes more difficult, and the difficulty of obtaining the correct 
standard error of a difference between two adjusted means still remains [48]. 


XVI-7.1 COVARIANCE ANALYSIS WITHOUT RECOVERY OF INTERBLOCK 
INFORMATION 

Cornish [66] has discussed covariance analyses for the balanced incom- 
plete block design (section XIII-2.1), the double lattice design (section 
ХІ-3.1), the triple lattice design (section XI-3.2), and the cubic lattice design 
(section XI-4.1). The form of the analysis of covariance without recovery of 
interblock information for the balanced incomplete block design with the 
treatments not in compact replicates is given in the top part of table XVI-11. 
The form for covariance analysis for the other lattice designs is that given in 
the top part of table XVI-12. Covariance analyses for other one-restrictional 
lattice designs may be obtained by a simple extension of the results given by 
Cornish [66] and others [183, 252]. 

The sums of squares for treatments (eliminating block) for both the 
variate Y and the covariate X are obtained in the manner described in 
section XIII-2. The sum of the zy products for the treatment (eliminating 
block) effects in a balanced lattice design is equal to 


2:0;0)0;() (ЕЮ), (XVI-25) 


1 
the variate Y, Q;(z) = Q; value for the covariate 
defined in formula (XIII-8). The sum of products 
designs is of a slightly more complex form. The 
inating block) effects in a double 


where Q,(y) = Q; value for 
X, and the other symbols are 
for partially balanced lattice i 
sum of the zy products for treatment (elim 
lattice design is 

DX du — DX Dg’ /k — 32Xs42 s /Ё, (XVI-26) 
vy i i j i 

where i 


ju = 99 + XO — Yay Ys. — Yu). 
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The value of 7;;' (formula (X1-120)) is the adjusted treatment mean ignoring 
interblock information. The corresponding sums of the zy products for treat- 
ments (eliminating block effects) in a triple lattice design and in a cubic 
lattice design are, respectively, 


gy — i 2Js/k— УХ. УЬ Хз. 2 dul /k 
Didi xX у x 2-7 Dai / >; з У; (ХҮГ.2Т) 
(u = i + njandv = a corresponding function ofiandj,sayv = i+ (n — 1)j 
for n = 2,3, ..., ork — 1) and 


2 Х.уһўуь — уэ, дун Dogan’ /k — iX 2k E Ух. уэ 
Л 4 (XVI-28) 


The quantities 7; and V; are the intrablock estimates of the ireatment 
means. The sum of the zy products for treatments (eliminating block) in 
other partially balanced lattice designs may be obtained from the formulae 
given by Rao [252, sec, 7]. 

The procedure for obtaining the treatment (eliminating block) sum of 
Squares adjusted for the intrablock Tegression is given in the top part of 
tables XVI-11 and XVI-12. Tests of significance, adjustment of treatment 


; and computation of variances of treatment 


TABLE XVI-11, Covariance analysis for one-restrictional lattices with the treatments 
not arranged in complete replicates 


Source of variation Adjusted ss 


ss 
Block (ign. treatment) = 
Treatment (elim. block) 


Intrablock error 


2 
Ya. -E JE. 
5 E ts-E. y “ху xx 
Treatment + error 


2 
ys W чу [LA 


Treatment (ign. block) 
Block (elim. treatment) 


Intrablock error 


Block * error 
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TABLE XVI-12. Covariance analysis for one-restrictional lattices with the treatments 
arranged in complete replicates 


Source of variation 


Replicate 
Block (ign. treatment) 
Treatment (elim. block) 


Intrablock error 


£ nel 
t 2 


Vote Wy tome RO 
Уу "yy 7 буу 


Treatment (ign. block) - Е 
Block (elim. treatment b-r-l в +в в ё, 
оск (elim ment) BS By By з. 


Bl + U, t у ?/u 
ock + error Uy "y Uy / hs 


Block im. treatment) adj. for intrablock regressio! В tat t*-mg t 
ock (е1: nt) adj al Ег п hs i xs 


XVI-7.2 COVARIANCE ANALYSIS WITH RECOVERY OF INTERBLOCK IN- 
FORMATION 

Covariance analyses in lattice designs with recovery of interblock informa- 
tion pose several problems. These are mentioned in the introduction to this 
section [48]. The first problem is to determine the appropriate regression 
coefficient to use in adjusting the treatment means for covariation in X. The 
two regression coefficients obtained from an analysis of an experiment de- 
signed as a one-restrictional lattice are 


b* = B.,/Bzz (XVI-29) 
b = E,,/Ez (XVI-30) 


and 
where the sums of products are defined in tables XVI-11 to XVI-12. If b* 
is approximately equal to b, Cochran [48] and Robinson and Watson [259] 
suggest that b be used in formula (XVI-9) to adjust the treatment means. 
If b* and b are not approximately of the same magnitude, they suggest the 
use of a weighted regression coefficient but do not state the form that the 
weighting should take. Presumably it should be of the following form for the 
ijth treatment in a double lattice design with two replicates (see formula 


(XI-9): 
{ КОЗҮ ke. ООС 
#ч(ай)) = Fy — Иж = au + Хы — та 3 ( k ыш я 2), 
(XVI-31) 
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where ў;; = unadjusted mean of the ijth treatment, Xu — yield of ird 
for the ijth treatment in the gth replicate, the i subscript refers to ge (d : 
of the pseudo-factor a confounded in arrangement 1, the j subscript re T 
the levels of the pseudo-factor b confounded in arrangement 2, and the 
regression coefficients are defined in equations (XVI-29) and EUT 
Formula (XVI-31) reduces to formula (XVI-9) when b = b* and when the 
block means are equal to 2. Also, formula (XVI-31) may be extended for more 
replicates of a double lattice design and for other one-restrictional lattice de- 
signs. Thus, for n groups (arrangements) of a one-restrictional lattice design 
with q repetitions of the groups the formula for adjusting the treatment means 
for variation in the covariate using a weighted regression is 


: |. b Xii.. Ais 
9ii(adj) = ў; — Ge = == ToS — жы) 


= 28 — 4+... 4 Anu _ au). (XVI-32) 


where treatment ij falls in the uth block of the 
to the summation over the q repetitions of 
are defined in equation (XVI-31). 
After the treatment means, 7;;, 
step is to obtain the weights for the 


nth group, the last dot refers 
à group, and the other symbols 


let u be the estimate of intrablock 
and let u' be the estimate of interblock informa- 
mate of interblock information from a one-restric- 
= n arrangements is A 


u= (f, — 1)/Е,; (Күр 
the estimate of interblock information is either 


0 


M г 1 XVI-34) 
TB DIESES =a; er 


or 


(ema ) 
» VI-35 
^ TB, /f, pe Ef. P 1) T 


to account for the non-orthogonality ; i п regres- 
Sion (section XVI-12,3) 3 7 iretued by using а commo 


he weights or adjustments of incomplete block effects for the treatment 
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totals from a double lattice design adjusted for the covariate by formula 
(XVI-9) are 

c! = MCA) — 2004, — Ы(А).: — 2001: (XV1-36) 
and 


c, = u([0).; — 2(Y),], — 6[(B).; – 2(Y)]-], — (XVI-37) 


where и is defined by equation (XI-34) using the weights obtained from 
formulae (XVI-33) and (XVI-35), the quantities within the square brackets 
are defined in section XI-3.1, example XI-2, the y subscript in the right side 
of the equations refers to the variate Y, the z subscript in the right side of 
the equations refers to the covariate X, and b is defined by equation (XVI-30). 
The subscripts on the left-hand side of equations (XVI-36) and (XVI-37) 
refer to the two groupings in the double lattice design. 

If the two regressions b and b* were used, the last part of formulae (XVI- 
36) and (XVI-37) would have to be extended to a form similar to that obtained 
in formula (XVI-31). 

The extension of formulae (XVI-36) and (XVI-37) to the triple lattice 
and to the rectangular lattice is straightforward and will not be given here, 
since they appear in the literature [48, 259]. The weights for the treatment 
totals adjusted for the covariate in the cubic lattice design are given below. 


&/ = Ij, — bj) + ETT, + Му — ЫШ. + MZ), (XVI-38) 


oy! = ftm, — bm.) + M M, + Ly! — b(M: + L;)], (XVI-39) 


and 


of = I, — Ы) + EMAL, + Jy — Мы 7) VIO) 
where the symbols on the right hand side of the equations are defined in 
section XI-4.1, example XI-6, method of analysis (i), the weights и and X 
are computed from the adjusted sums of squares using formulae (XI-63) and 
(ХІ-64), and the subscripts оп the right-hand sides of the equations refer to 
the covariate and variate, respectively. The subscripts on the left-hand sides 


of the equations refer to the X, Y, and Z groupings in the cubic lattice design. 


The value of b is equal to Ez,/ Ez, where the latter values are defined in table 


XVI-12. t : s 
The analysis of the one-restrictional lattice design now proceeds in the 


manner described in Chapters XI and XIII using the above weights u and 
u' in place of ш and w’, respectively. The standard errors obtained from the 
formulae given in Chapters XI and XIII are only approximate, since they do 
not contain the sampling variance for the estimated regression coefficients. 
This additional component of variance usually will be small in the larger 


lattice designs [48]. 
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XVI-8 Two- and Higher-Restrictional Lattice Designs 
XVI-8.1 TWO-RESTRICTIONAL LATTICE DESIGNS 


Covariance analyses for the two-restrictional lattice designs, oe ae 
lattice square, have not been described in the literature. Cornis ү ar bg 
Bliss and Dearborn [25] present the covariance analysis, illustrate od 
example, for a semi-balanced lattice square design without recovery o 


• е 
With the recovery of interblock information, complications of ‘the oe 
discussed in the previous section arise in the covariance analysis for aha na 
ment designed as a lattice square. If a common regression is used, sonal tae 
block error regression, the same problems arise as for the p 
tice designs, If a weighted regression is used to adjust the treatmen 
three regression coefficients are necessary; thus: 


b = E,JE, а т. 
с = Uzy/Uz2, 

and NA (XVI-43) 
PSS zy, тту 


-14. Тһе 
where the sums of Products are defined in tables XVI-13 and XVI-14 
adjusted trea 


7 а iven 
eatment mean using a weighted regression similar to that g 
in equation (XVI-30) for the two replicates of a lattice square 1s 


: i , АВ)» + (ABP) 
(аа) = gy — Д, - 0 2i 2u Tw ee 


(А), + (АВ), ғ (В); + (AB), _ =), (XVI-44) 
- „(0+ (AB), d Э) Б p (But (m. 2 


XII-1. The А and AB pseudo-effects are confounded with rows and t 
and AB? pseudo-effects аге confounded with columns. Formula (X 


mn 
reduces to formula (XVI-9) when b = b- = b. and when the row and colu 
means are equal to z. T 


А iatel 
he extension to more replicates follows immediately 
from the Tesults in the Previous section, the 
After the treatment means are adjusted to a common covariate ег. 
Next step is to obtain the weights for rows and columns in order to adJ li- 
the treatment means for row and column effects. The procedure for two rep 


P i ign- 
cates of the lattice Square design follows that for the double lattice AD 
If a single regression coefficient is used, the same difficulties encountere 
the double lattice des 


cals pee infor- 
ign arise in the estimation of the row and column 


* 


TABLE XVI-13. Covariance analysis for a semi-balanced lattice square design 


Sum of products 


Adjusted ss 


Source of variation 


Replicate 


Treatment (ign. row 
and column) 


Row (elim. treatment) 


Column (elim. treatment) 


Intrablock error 


Row + error® 


Column + error? 


Row (elim. trt.) adjusted for intrablock regression 


Column (elim. trt.) adjusted for intrablock regression 


"Luv = Bay + Euv; May = Cuy + Ew; u, v = x, y. 


TABLE XVI-14. Covariance analysis for a balanced lattice square design 


Source of variation 
EI 
Replicate 


Treatment (ign. row and 
column) 

Row (elim. treatment; 
ign. column) 


Column (elim. treatment 
and row) 


Row (elim. treatment 
and column) 


Column (elim. treatment; 
ign. row) 


Intrablock error 


Row + error? 


Column + error® 


Sum of products 


Adjusted ss 


Row (elim. trt. and col.) adj. for intrablock regression 


Column (elim. trt. and row) adj. for intrablock regression 


у = х, y. 


“Lav = Ruy + Euv; Muy = Cav + Euv; 
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g ry Uc. variance analysis 
mation. The approximate weights Ur, Uc, and u from the covariz Nahe. 
lon. i : 
sponding to the estimated weights Wr, We, and ш, from the variance 
соггез 
analysis are 


r-1 (XVI-45) 
ur rRyy/f. = Ey / (fe = 1)’ 

r-]1 (XVI-46) 
He rCyy'/fe F; Еу, /(f. > 1) 

and u= (f, 0/Е,, Сини 


where the sums of squares are defined in tables XVI-13 and DM (XIL3) 
The weighting factors for rows and columns are given in formu em 

and (XII-4) using the quantities u,, ue, and u for the нчен ы... 

respectively. The following four adjustments are added to Ea | зап 

totals adjusted for covariation їп X to obtain the treatment to а г 

for covariation in the X variable and for row and column effects: 


CVI-48) 
e = MIO = (А), — B). — 2(4)],), ae 49) 
Ge’ = МАВ). — 2(4B);], — b[(AB).g — %(АВ),,].), a 
си = #{[(B).; — 208), — b((B)., — 2(В),/).), rae 


ae oe! Г (Ир, — 2(AB?,,], — b[(AB?)., — 2(AB%oa]e}, (XVI-5D) 


. -36 , 
where the quantities in the above equations are defined in formulae (XVI-36) 
(XVI-37), (XVI-41), and in example XII-1. 


The remainder of 
for a variance analys 
they do not contain 


the covariance analysis proceeds in the same mnn 
is. The resulting standard errors are approximate, KL 
а component for the sampling variance of the E UM 
coefficient. The comments on the covariance analysis given in the pre 
sections hold in this section ag well. ith more 
The extension of the above results to lattice square designs with d 
replicates and to other two-restrictional lattice designs is straightforward. 
XVI-8.2 LATTICE DESIGNS WITH MORE THAN TWO RESTRICTIONS 
Covariance anal. 
the allocation of th 
nature as those de: 


а n 
yses for lattice designs with more than two rostriction да 
€ treatments within a complete replicate are of the Er. 
Scribed in sections XVI-7.1, XVI-7.2, and XVI-8.1. just- 
number of regression coefficients increases, and the computations and ad 
ments become more complex, If Possible, proper stratification of the buo 
mental materia] into homogeneous subgroups of the covariate X elimin: ns 
the necessity of а Covariance analysis. Ag the nature of the computatio 


у /ses 
becomes more complex and as the assumptions relating to covariance analy 
become harder to Satis: 


jation 
fy, the more necessary it becomes to control сохапа 
of X and Y by proper Stratification, 


t 
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XVI-9 Change-Over Designs 


Fieller [117] describes a covariance analysis for the simple change-over 
design (see section XIV-2.1). Covariance analyses for the other designs in 
Chapter XIV have not been discussed. The analysis of covariance for the 
simple change-over and for the double change-over designs are discussed 
here (tables XVI-15 and XVI-16). The analytical procedure for other designs 
may be obtained from the results presented below. $ 

The linear model for- the yield of the ijhth observation (тот. а simple 
change-over design with a covariate is 


Үз = рі pty trat В(Х;„— £) + en (XVI-52) 


where и = mean effect, р; = ith row effect, y; = jth column effect, ra = hth 
treatment effect, 8 = regression coefficient for Y deviations on X deviations, 
and ¢;;, = random component of error. The key-out for the degrees of freedom 
and for the sums of products is given in table XVI-15. The adjusted treatment 
means are computed from formula (XVI-9), and the variance of a difference 
between two adjusted means is obtained from formula (XVI-10). 

The covariance analysis for a double change-over design is somewhat 
more complex. The linear model for the ijhth observation from a double 
change-over design with a covariate is (for q — 1) 

Yi = рф о; + 8;4- (a8); + 6,+ pet В(Х;;› Ex. 2) + Eijhy (XVI-53) 
where the effects are defined in equation (XIV-11) and £ is the regression of 
the Y deviations on the X deviations in the error line. The key-out for the 
degrees of freedom and for the sums of products in the covariance analysis 
of a double change-over design is presented in table XVI-16. The sums of 
the zy products for the treatment effects in the analysis of covariance are 
given below. The formulae are for three treatments and for one set, but may 
easily be extended for more treatments and for more sets. 


TABLE XVI-15. Covariance analysis for a simple cross-over design with 2 treatments 


Source of variation 


Column 
Row 
Treatment 


Error 


"Suv = Tay + Ew; u, V = X, Y- 
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TABLE XVI-16. Covariance analysis for q repetitions of a double change-over design with 
3 treatments 


Adjusted ss 


Group 
Period 


2q-1 
2 
2(2q-1) 
liq 


Period x group 

Within group 

Direct (ign. residual) 
Residual (elim. direct) 


Direct (elim. residual) 


Residual (ign. direct) 


Residual (elim. direct) adjusted for error regression 


Direct (elim. residual) adjusted for error regression 


By Ey Ex 
Ho. Uoc || аз 
Sy Зуу Sa О; 


"To, = Ви, + Euv; Suv = Duy + Ey; uv = х, у. 


Direct effects eliminating residual effects (see formula (XI V-4)): 
Dey = [5X..4 + 2a — b — € 82+ 5 — 2X.../3][5Y..4 +20 — b-—c+s 
+ se — 2Y.../3]/120 + [5X..5 4- 2b — a — c 4 s, T — 2Х.../3][5Ү..в 
+26 — a e+: 9 4+ 5, — 2Y.../3]/120 + [5X..¢ + 2 — a — b+ sit ss 
— 2X.../3][5 Y..c+2c—aq— b+s+5,-— 2Y.../3]/120 
(4X...) (AY... 
- SAO, (XVI-54) 
Residual effects ignoring direct effects (see formula (XIV-5)): 
Folla + s; + s, [3a + s, 4 Sely + 


[3b + s; + s], [8b + зз + 94], 
+ [Be + s1 + se]-[3e + sı + $ь],} 


ВЕРО х јав. с) + Y...]/90. (XVI-55) 


Direct effects ignoring residual effects (see formula (XIV-3)): 


3 
EXAY..— X.Y. ng. (XVI-56) 
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Residual effects eliminating direct effects (see formula (XIV-6)): 
Rey = Alit X. + 52+ aar У.а +9 86] 

+ [Bb + Х.в + ss + s:J[3b + Y..8 + s3 + 8] 

+ [3с + X..c + $1 T Ss] [3c + Y..c d 51 + 5]} 

— [3(a +b -- c) + 2Х...][3(а +b + c) + 2Y...]/72. (XVI-57) 
The quantities within the square brackets are computed from the data for 
the variate or for the covariate, depending upon whether Y or X appears in 
а bracket or as a subscript. The sums of products for the error line are obtained 
by subtraction. The errors of estimate are obtained as described in table 
XVI-16. * 

The treatment means adjusted for residual effects are obtained from an 

extension of formula (XVI-9). For a given treatment, say А, from a double 


change-over design with three treatments, the mean adjusted for residual 
effects (see formula (XIV-12)) and for the covariate is equal to 


Ga’ = [5Ү..А + 2a — b — c + s: + ss — 2Y .../3]/24 


- ut (Biat anb otat 2X9] 4 
(XVI-58) 


where the various quantities are defined above. The residual effect for treat- 
ment A adjusted for the direct effect of treatment A and for the covariate 
is obtained from the formula (see formula (XIV-19)), 


[3u + са $82 + $e] 26 ie {[3а 4 Koy E $2 + 56] 
— [(a +b + c) + 2X.../3]]  (XVI-59) 


The formulae for obtaining adjusted means for treatments B and C and for 
double change-over designs with more than three treatments are constructed 


in an analogous manner. 


XVI-10 Covariance and Unequal Numbers of 
Observations per Treatment 


XVI-10.1 COVARIANCE ON DUMMY VARIATES FOR MISSING OR MIXED-UP 
PLOT YIELDS 

Bartlett [14] has described a method for calculating the analysis of vari- 
ance for an experiment with missing data. A covariate (sometimes called a 
dummy covariate) consisting of zeros associated with all the yields obtained 
in the experiment and a one (a minus one may be more advantageous com- 
putationally) for the yield of the missing observation is used. A zero is in- 
Serted for the missing yield, and then an analysis of covariance is calculated. 
The error mean squares obtained in the above analysis of covariance and in 
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the analysis of variance after computing the yield for the missing datum are 
identical. The estimated yield for the missing observation is 


Yo — bzo = —b = Е, Е, (XVI-60) 
where yo = 0, ж = 1, and the sums of products E., and E.. are obtained 
from the error line in the analysis of covariance, The sums of squares for the 
covariate are simply the degrees of freedom divided by the total number of 
observations in the experiment. 

If more than one observation is missing, a covariate is used for each missing 
datum, and a multiple covariance is computed [14]. Nair [216] applied 
Bartlett’s method to the randomized complete block and to the latin square 
designs with several missing observations and with mixed-up observations. 
Smith [269] discusses the use of dummy covariates in a latin square design 
and presents some basic viewpoints on the use of covariance for such purposes. 

The usefulness of Bartlett’s covariance analysis with missing yields, the 


relationship of the method to the analysis of variance with estimated yields 
for missing data, and the use of the method in the analysis of variance with 
unequal numbers are illustrated in the f. 


ollowing example. 
Ezample XVI-2. A set of artificial data was constructed for ease of computation. 
The design is a randomized complete block with three treatments. The yield for treat- 


TABLE XVI-17. Data for 3 replicates of a randomized complete block design with 
i 3 treatments and one missing yield 


Replicate 
Treatment I II III 


TABLE XVI.18. Covariance analysis for data in table XVI-17 


Adjusted results 


Total 


Replicate 8/9 
Treatment ГЕ 
Еггог 1 /5 Dy, 


Error + replicate A 


7 

5 
Вер1: 5 
leplicate adjusted for covariate | e] 
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ment C in replicate III is missing. À zero is inserted for the missing yield, and the co- 
variate composed of zeros and a one is paired with the Y variate as shown in table 
XVI-17. The analysis of covariance for these data is given in table XVI-18. 
The estimated value for the missing observation obtained from formula (V-5) is 
equal to 
3(30) + 3(12) — 78 _ 1, 
(3 —1)(3 — 1) i 


This value may be obtained from formula (XVI-60); thus, — (—16/3)/4/9 = 12. The 
estimated value for the missing yield is inserted in the table of yields, and the analysis 
of variance is then computed (table XVI-19). The error sums of squares in table XVI- 
18 and XVI-19 are identical, as they should. be. As stated in Chapter V the sums of 
Squares for treatments and replicates are overestimated. The correct replicate and 
treatment sums of squares are given in table XVI-18, and these are somewhat smaller 
than the comparable ones given in table XVI-19. 


TABLE XVI-19. Analysis of variance for the 
variate Y after estimating 
the yield for the missing item 


Source of variation 
Total 


Replicate 
Treatment 
Error 


If the procedure for obtaining the correction for disproportion as described by 
Snedecor [273, Ch. 11] is followed, the correct sums of squares for treatments and for 
replicates are obtained. First compute the sum of squares for the effects weighted by 
the number of observations (table XVI-20). For example, the sum of squares for treat- 


ments is equal to 


15? 395, 30 78% _ уот 
eg ke 8 127.5. 


The residual sum of squares їз equal to 


155.5 — 43.5 — 127.5 = —15.5. 


The correction for disproportion is equal to 
error sum of squares — residual sum of squares — 10 — (—15.5) — 25.5. 

um of squares and subtracted from the repli- 

ble XVI-20 to obtain the correct sums of 

1 sum of squares in tables XVI-20 and 


The correction is added to the residual s 
Càte and treatment sums of squares in ta 
Squares (tables XVI-18 and XVI-21). The tota 
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XVI-21 is equal to the replicate sum of squares plus the within-replicate (treatment 
(eliminating replicate effect) and error sums of squares) sum of squares. 


TABLE XVI-20. Analysis of variance on yields of the variate 
weighted by number of observations 


Source of variation 


Total 


Replicate б 
Treatment 127.5 
Residual -15.5 (by subtraction) 


TABLE XVI-21. Sums of squares in table XVI-20 corrected for dis- 
proportion (correction for disproportion — 25.5) 


Source of variation 


Total 
Replicate 
Treatment 
Error 


155.5 = 43.5 + 109 + 10 
18 = 43.5 - 25.5 
12 = 127.5 ~ 25. 


2 
10 =-15.5+25.5 


XVI-10.2 COVARIANCE ON DUMMY VARIATES FOR NON-ORTHOGONAL 
TWO-WAY CLASSIFICATIONS 


Quenouille [251] has extended Bartlett's [14] results to the analysis of 
а 2X s table with unequal numbers in the 2 X s subclasses. He states 
that the method may be substituted for the ordinary method of analysis 
[273, Ch. 11] when the deviations from orthogonality are small. The short- 
cut method described by Snedecor [273, Ch. 11] is as simple as Quenouille's 
method, but the method presented by the latter author [251] is described 
here for completeness, 

Quenouille's [251] method of analysis for a non-orthogonal 2 X s classi- 
fication involves the use of a covariate composed of zeros and ones. In the 
classification composed of two categories a one is assigned to every individual 
11 one category and a zero is assigned to every individual in the second 
category. The resulting subclass totals and number of individuals per subclass 


are 
A B category 
cate- a o mec c NE Total 
gory Р 5. E rc e| b, 
9, X | No. an x 
FE —|— o. Y X No.| Y х9 Nes ТҮ 8 
& Ке x o. | | Yu. | me n, | Y n n Yi ni: 
- EOM LI |__| рен Үз 0 P Ya. 0" S Ys 0 
Total | n. | Үл. xm cre eee ct c MG : 
s 
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The analysis of variance for these data is presented in table XVI-22. In 
the analysis of variance for disproportionate numbers in a 2 X s table Que- 
nouille's method results in the same values (within rounding errors) obtained 
by the method described by Snedecor [273, Ch. 11]. 


TABLE XVI-22. Analysis for a 2 X s table with unequal numbers in the subclasses 
Covariance analysis 
Sum of products Adjusted ss 
Source of variation 
аг 92. xy ox af s 
T n-2 T 
xx 


5 
Total E T 
кр т ху уу уу xy жх 
В - в 
5-1 ky 
Residual n-s 


B (eliminating A effects) 


Source of variation 


B (eliminating A) 
A (eliminating B) 
Within subclasses 


Among subclasses 


A x B (eliminating A and B) 


XVI-10.3 COVARIANCE ANALYSIS OF n-WAY CLASSIFICATIONS WITH UN- 
EQUAL NUMBERS IN THE SUBCLASSES 


XVI-10.3.1 The randomized com 
of values for each experimental unit. 
classification with disproportionate numbe 
by a randomized complete block design wi 
missing observation may be for the variate Y o. 
given pair of values may be missing. | 

n a randomized complete 


If a value for the variate, зау Y;;, is missing i : 
block design, Bartlett [13] states that the corresponding value of the co- 


variate, say Х;;, should be discarded. The estimated values for Ү;; and for 
Xi, say f; and Xy, are computed using the ordinary formula (formula 
(V-5)) for estimating the value for a missing observation in a randomized 
complete block design. The number of degrees of freedom associated with 
Е... Ezy, and Е,, (table XVI-3) is equal to (r — 1)@ — 1)- 1. The analysis 
of covariance now proceeds in the usual manner except that Ey,’ is associated 
te as in sections XVI-10.1 and XVI-10.2. 


plete block design with one pair 

The simplest form of a two-way 
rs in the subclasses is represented 
th one missing observation. The 
г for the covariate A’, or the 


YThe covariate here is not a dummy varia 
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with (r — 1)(v — 1) — 2 instead of (r — 1)(v — 1) — 1 degrees of freedom. 
The adjusted mean is computed from formula (XVI-9), using only the actual 
values obtained for the Y variate and only the values of the Ж covariate 
which are associated with the Y values. The same procedure is followed if 
both members of a pair are missing. The reason for estimating both members 
of the pair of values for which the Y value is missing and for which the value 
of the covariate X is not missing is that there should be no contribution to the 
regression from the missing plot when the final yields, Y, are adjusted for 
covariation in X. The fact that the value of the covariate is available is of 
no use in obtaining an adjusted mean. Only those values of X which are asso- 
ciated with Y values are utilized in obtaining the adjusted treatment mean. 

If, however, the X and Y values are considered as coming from a bivariate 
distribution, the interest lies in the means, j and z, the variances, and the 
covariances for the two variates. In this case, there is a loss in information 
if one of the values is discarded simply because the other member of the pair 
was inadvertently lost. If the value of Y is available but the associated X 
value is missing, it appears that the above procedure [13] is not satisfactory, 
since it does not utilize all the data. This is especially true for the smaller 
experiments. А suggested procedure in this case is to estimate the missing 
value for the covariate by formula (V-5). This value is inserted in the table 
of yields and the analysis of covariance is computed in accordance with the 
procedure outlined in table XVI-3. The adjusted mean for the ith treatment 
is obtained using all r values of Y in computing j;. and the г — 1 X values 
associated with the Y values in computing z;.. The quantity Е, is associated 
with (r — 1)(v— 1) — 1 degrees of freedom; i.e., no reduction is made for 
estimating the value of X. However, this should be accounted for in computing 
the variance of the adjusted treatment mean. 

An alternative procedure would be to estimate the missing value for the 
covariate subject to the restriction that £,,’ should be a minimum. Fhis 
would necessitate the subtraction of one degree of freedom from E,,'. The 
value for the missing covariate, say Xu, is obtained from the formula, 

EL 1 


r—1 =] 
-— m аа 
(Yu =e = Ga + 9) > Be = Р. (XVI-61) 


gested, since it wa аы» їп Ё апа E.,, an iterative solution is sug- 

X. ЖОП үч is a possible to solve for Xi explicitly. A first guess for 
ne aine from equation (V-5). 

e Y value, say Yu, is missing and if it is considered that the asso- 


ciated 
UAE X value, say Xu, should be retained for some special reason, the esti 
missing value is obtained from the formula 


ate с 2 n LY. y Xa- i zi DE, 
SD 2-1) — DE. 


Since the quantity X. 


з (XVI-62) 


| 
| 


1 


§ XVI-10.3.2] With Unequal Numbers in the Subclasses 515 


where E 
(fu + Ya) + ХХ. 
Ez, = YuXa + > Х.У — 2 

Б 


г 


(fu + Y) Xa+ EXS;Y; " (fa + Ү.)Х.. 


rU 


(XVI-63) 


The above formula was obtained by differentiating Ey, with respect to 
fu, setting the result equal to zero, and solving for Yu. In this case, Ey,’ 
is associated with (r — 1)(v — 1) — 2 degrees of freedom, since one value 
was estimated. 

The covariance analysis for more than one missing value or pair of values 
follows that described for estimating values for more than one missing plot. 
A similar method could be developed for the covariance analysis for data in 
which one or more of the yields are mixed up for either the variate or the 
covariate, or both. 


XVI-10.3.2. The randomized complete block design with n;; pairs 
of observations per experimental unit. Covariance analyses for two- 
way classifications (such as the randomized complete block design, or a fac- 
torial arrangement of two factors), with unequal numbers of observations per 
experimental unit or per subclass may be classified as follows: 

Case I—Interaction absent. 

Case II—Interaction present, and the interaction e 
fixed effects. 

Case ITI—Interaction present, and the interaction e 
random variates. 

The linear model for Case I is 

Үе nd rid esc B(Xia — T) + en 
mean effect, the first factor (treat- 
) effect, the average regression of Y 
r component, respectively. The sum 


ffects assumed to be 


ffects assumed to be 


(XVI-64) 


Where д, Ti, ру, B, and eis represent the 
ment) effect, the second factor (replicate 
on X in the error line, and the random erro 
of squares to be minimized is 


EES Yaca ВО D (КУГ) 
i=l jal hel 
Differentiation of (XVI-65) with respect to m, to ri, to pj, and to B yields a set 
of r + v + 2 equations. These equations plus the following equations: 
(XVI-66) 


D г 
yA = 0= des 
i=l ј=1 
ates for effects have been obtained for all three cases, 


"Тһе г i i d estim: 
шип equate er n made (Unpublished results). 


and applications of the results have bee 
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yield the least squares estimates, д, ?;, ду, and й, of the effects и, Ti, pj, and 
В, respectively. Solution of the equations is possible for n;; = 0, 1,2, ---. The 
procedure used here follows that for the variance analysis by the “method of 
fitting constants" as described by Yates [318a], Snedecor [273, sec. 11.12], 
Hazel [152], and Henderson [156]. 


The linear model for Case II is 
Yi, = n rit pj + рту + В(Ху, — ®) + ejw  (XVI-67) 


Where pr;; represents the interaction effect of the two factors and is assumed 
to be a fixed effect and the other effects are defined in equation (X VI-64). 
The sum of squares to be minimized is 


Y. n i 
уз 2 уз [Yay — n — ri — ру рту — 8(Ху, — BP. (ХҮЇ-б8) 
i=l j=l h= 

The resulting ry +r +v + 2 equations and the following equations yield 

the least squares estimates of the effects: 


i5 = Das = Mes = Doers = 0. DEM 
j= E = 


No solution is possible if one or more of the п; = 0. An approximate sontog 


may be obtained by assuming that the interaction effect is zero in any sub- 
class for which ni; = 0; this results in the following assumptions [155]: 


oy ri 
У, È У, ко, (XVI-70) 
© = 


approximation depends upon the number of ni; = Н 
nteraction effects, For Case II the variance analyst 
3 d squares of means" analysis (273, sec. 11.12; 3188]. 
The linear model for Case III is 
Y; 


jr = utr + Pi + бу + Bi(25;. 

where 5;; T - 

class or i 4. Penis a random error component associated with the ijth sub 

or (ema M onis Unit, B; represents the regression of Y on X in the error 

and the ns ES PAIS еш the regression of Y on X within айран 
е : 

estimate of gig “ined in formula (XVI-64). The least squar 


The closeness of the above 
and upon the size of the j 
is known as the “Wweighte 


= £) + B(Xiy — $5) + eim (XVI-71) 


Ё " $0760 nq } 

% 2 >> 2 ЖУУ Ду, — 8752/5 5 S (Хи — yN. (XVI-72) 
el : А 

qu ме 9f и, т, pj, and 8, may be obtained by differenti" 

imposing the restrictions that Бе, 5 T band the results to zero, ап 

Um УН 


Уш. = 


н—т;— pj = Biz. = ER (ХУ1-7) 
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The value of the weights w;; will depend upon the assumptions made. The 
true variance of a subclass mean gi. is тз? + ¢2/ni;. If the subclass means 
are weighted inversely to the variance with which they are estimated, then 
шу = 1/(0 + o2/nij) = ng/(ngos + o2). Now, if сг? is small relative to 
с2/п;у, then ш;; is essentially n;;. If, on the other hand, ғ; is large compared 
to с/т, then w;; is essentially equal to one. However, the true situation is 
usually somewhere between these two limiting situations, and no solution is 
possible unless o and ог? (or o2/cs) are known. An approximate solution is 
possible if the estimates 22 and ¢2 from the experiment are used for the popu- 
lation variances of ез? and сг. This approximation is not too bad if sufficient 
degrees of freedom (say greater than 20 io 30) are available for the within- 
subclasses and for the error sums of squares. Also, if any ni; = 0, the cor- 
responding Wij = nij/ (Nijo? + е2) = 0. 

XVI-10.3.3 n-Way classifications and multiple covariance. Hazel 
[152] illustrates the method for a multiple covariance analysis in a two-way 
classification with unequal numbers in the subclasses. His results, plus those 
in the preceding section, may be utilized in setting up the analytical procedure 
for more complex situations. 


XVI-11 Covariance Versus Stratification 


The discussion in the preceding sections pertains to covariance analyses 


for the various designs. Stratification of the experimental material was used 


to control the variability among the various complete blocks, incomplete 
f experimental material, it is 


blocks, rows, and columns. For certain types o ў 
possible to obtain а measure of the differences, other than treatment dif- 
ferences, among the experimental units. These measurements may then be 
used as the values for the covariate, and stratification may be dispensed 
with. For example, it may be possible to obtain initial weights of animals 
going into a feeding experiment. With these initial weights the animals may 
be grouped into complete blocks'or into incomplete blocks in such a way that 
the weights of the animals within the blocks are nearly equal. On the other 
hand, the animals could be randomly assigned to the various treatments, a 
à covariance analysis of final weights on initial weights could be popes A 
A third possibility would be to stratify the animals into relatively ape 
geneous groups and to obtain initial weights for a covariance analysis 2 oe = 
to Control any residual variation caused by differences in initial weig ts. | е 
Joint use of а covariance analysis and stratification may be indicated ыг 
Certain types of experiments [60, sec. 3.8; 127, sec. 49.1; 247; 273, ved : | 

eterson et al. [247; see example VI-1] give an example illustrating t s 
use of covariance and stratification to control the same variable, size о i с 

һе use of covariance оп leaf size alone would have been more efficient in 


is experiment than stratification either alone or with covariance. 
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Fisher [127, sec. 48], Quenouille [251], and Federer and Schlottfeldt [115] 
discuss the use of covariance to control variability in experiments where a 
gradient exists. А linear or curvilinear gradient may exist in the following 
types of experiments [115]: 


(i) greenhouse experiments where the source of heat is located in the center or 
on the sides of the house, 
(ii) field experiments located in areas containing drainage tiles or ditches, 
(iii) field experiments containing a depression in the center of the replicates, 
(iv) orchard or vineyard experiments on undulating topography, 
(у) animal experiments with animals located at varying distances from the source 
of heat, food, water, etc., 
(vi) experiments in which the yields are affected by slowly migrating insects 
entering the area from one side, 
(vii) and other experiments. 


If the direction and shape of the gradient are known, the covariate may Бе 
assigned to the various experimental units and a covariance analysis computed. 
For example, if the gradient is linear, the sequence of numbers 1, 2,8, 99%, 0 
may be assigned to the various units. These numbers represent the values of 
the covariate. If the gradient is quadratic, one of the covariates consists of 
the above sequence of numbers and the second covariate consists of the squares 
of the numbers. For more complex gradients, other functions of the numbers 


are necessary. 


XVI-12 Expectation of Mean Squares 


There are even more unsolved problems associated with variance com- 
ponent analyses involving a covariate than in the ordinary variance com- 
ponent analyses. Much theoretical work needs to be done before all problems 
are solved. However, enough theory is available for estimating variance 
components from the various adjusted mean squares. 


XVI-12.1 COMPLETELY RANDOMIZED DESIGN 


Variance components in a covariance analysis! have been discussed by 
Cochran [53] and Crump [80] for the completely randomized design. The 
estimate for c? is simply the adjusted mean square for error, Eyy'/ (r-- 
s v — 1). Two estimates of the treatment variance component are available 


тот an analysis such as that given in table XVI-1. One estimate is obtaine 


from the following combination of mean squares: 


Ho, —Gy/G. Е, } (хуї-14) 


0— 2 r.—v—l 


"This is not to be confused wi 


2 Ы со“ 
variance of effects; e.g., E[rip;] = th covariance components [156] which refer to the 


Orp. 


T MEC 
m ———————— c 
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When r; = г for all i, Ё = r, and the expectation of equation (XVI-74) is 
lp; + ro? — с2} = o. x 
VI-75) 


The second estimate of o,? та i 

а 2 may be obtained from a combination of the 
EN squares obtained by dividing G,,/ and E,,’ by the appropriate degrees 
of freedom. The expectation of Gy,’ is equal to 


(0 — De? + ғ А —1— бз Sas | EN ( Ea) 
T? v Ta (v — 1) Cline o те 3 
(XVI-76) 


The estimate of с,? is obtained as 


TONNEN NES 0l E,/ 
$^ = у= + EJTA {б = j. КҮР) 


In А pe е 
"uc instances one estimate has a smaller variance than the second esti- 
e while in other cases the reverse is true [53, 80]. 


XVI- 

VI-12.2 RANDOMIZED COMPLETE BLOCK DESIGN 

en by equation (X VI-13) is the appro- 
k design. Furthermore, in order 
and e; are random independent 
al to (table XVI-3) 


7 рше that the linear model giv 
E Жы for the randomized complete bloc 
т Hn ify the problem, assume that the ту, ру, 

iables. Then, the expectation of yy’ is equ 


ЕЕ] = {УУ = >н? c LY; 4X 


LL (Ky i — Zy +2) = Е 
= rou? + o Бо + od) + P'S 
— (rou? + reo? + vo 2 + voè + BT) 
24 8.) + (ти 
Xy — 8 = ахы i + al 


— (rog? + roo è + гот? + го 24 v0? + res + сё) 
1 
-EE|EXbtstatÉ 
REED О ОСЫ EE, + oè) 
= [(—1)@—1)— П, (XVI-78) 


where S, = Rez + Tes + Ёш 
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The expectation of the treatment (adjusted for error regression) mean 
square is 


атл = BEE уг - 152) -ЕЕТч 213 — нр, 


T; zy)? A 
= Er. - Gatti er) 


Tro; 


= (v — D (c? + re?) + ВТ... — (6 Tog T 8 EJ) 
T (сё + EE) 


2 
=(- {гг аре Yo -1- eu (XVI-79) 


with v — 1 degrees of freedom. 
The estimate of the treatment variance component from the mean squares 
Т,/ and Е, is obtained as follows: 


#2 v—1 dar = E, \. Х 
dé ати Ег С 


Likewise, an estimate of c,? may be obtained from the mean square for 
treatments adjusted for the treatment regression; thus: 


= 2 f 
p= Та s QE ova 


The expectation of the other sums of squares in the analysis of covariance 
and of sums of squares with more than one sample or individual per experi- 
mental unit may be obtained in a similar manner [152, 156]. 


XVI-12.3 DOUBLE LATTICE DESIGN 


The linear model for one set of the double lattice design with a covariate is 


Үш = u + pit Bu + riy + eg + Bilas. — 2) + Bo Xii; — н. 


Уну = и + pa + Boj + туу + еу + Balte — 2) + BOX — ATED 


s the plock 


where the-effects are defined in sections XI-8 and XVI, В: i т 
block er 


(eliminating treatment) regression coefficient, and 8з is the intra 
regression coefficient. In the above set of equations, it should be note ^ 
the 8; and 6; are not regression coefficients in the same sense that WE 
are. The expected value of 812 = 6,2, and of 82 = 62, but the expecte 

of Bi? and of Bo; = op’. 


— mun 
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If 8; = b: = В, the above equations reduce to 


Yu = w+ md В: + rg + ез + 80335 — 2) 
. (XVI-83) 


Yay = p + po + Bos + ту + en; + В(Ху — 2) 
In obtaining the expectations for the various sums of products given in 


table XVI-12, it is assumed that formula (XVI-83) is the appropriate linear 
model for a double lattice design with a covariate. The expectation of By, is 


Е[В,„ = tel Dan. = ğu- — ўз.. + 91..)° 


T 2203; = 925 — hee + 2) 
= 2(k — le? k(k — Des + ВВ. (XVE84) 
The expectation of B,,* is equal to 


rfs, = эы} = E[B] — ag E [Ys — Yu — Bu. — fae e) 


+ XQ; – You) Gis — S25 — &- + a.) 
J 


= 2(k — De? ERE — Djeg + PBa — fog — s — BB. 
= (2k — 3)(o2 + Вс /2), (XVI-85) 


with 2k — 3 degrees of freedom. 
The expected value of the block (eliminating treatment) sum of squares 


adjusted for intrablock regression, By,’, is equal to 
r E. 2 E. 
к{в„ - Gat Bel + шг} = ив) 


= a Lg E XD = д... — X4; 2) 


1 у " 3 
+ (Ха; — ža.. — Xa DI} + УУ Ү. Аш = Zi. — 2.5 + 2 
— Қ. — Zu. — Be HH.) + з Ү.ДХәу — #.. — 25 + 2 
v HAGE "* 22.; = qi + o..)] : 
= 2(k — 1)е2 + k(k — Dop + 828. 


3 khi cn т E.) + (od + PEG) 
(s: "mes prd ; 


? B-z i 
"opere дк -1- x5 25) quur 
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with 2(k — 1) degrees of freedom. 

The expectation of the intrablock error sum of squares adjusted for co- 
variation in X is 


E[E,,] = E(E,, E Е.2/Е.\ 

= ESED Y; = Je.— 9.5 +9 Alz — ди. — Ga + й--)]® 

+ Ys = a — Gp $9 – Юу — T — Йй. + a.) 

= Fig. = (k — 1)'e? + BEz — (c? + BE) = k(k — 2)o2, (XVI-87) 


with k(k — 2) degrees of freedom. 
From equations (XVI-86) and (XVI-87) the estimated amount of inter- 
block information is equal to 


1 — B./2(k — 1)(В + Е..) 


w = ТБ 7/(Е — 1] ТЕЕ — ТУ] + Be /20 — Ba + ЕЁ] 
(XVI-88) 


and the estimate of interblock information is 
Dl k(k — 2) 
Ey,’ З 


Formula (XVI-88) results in a different value for w' than that obtained from 
formula (XVI-35). However, the difference will not be large if k is greater 
than 4 or 5 and if B../2(k — 1) is not radically different from E../(k — 1)’. 
As an approximation, it would appear that formula (XVI-35) is suitable for 
estimating w’ using В,, and E,,'. If B,,* and Е, are used to obtain the 
estimate w’, then formula (XVI-34) gives the correct value. 

If the two regression coefficients, 81 and 6, in equation (XVI-82) are not 
equal, then the expectations of the mean squares may be obtained in a manner 
similar to that described above, and the weights may be estimated accordingly- 
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resolvable, 415, 416, 423 

size and shape, 313, 314 

tests of significance for treatments, 419 
Incomplete latin square design, 429-433 
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advantages and disadvantages, 136 
analysis, 144-158, 281, 289 
arrangements, 137-140, 244, 281, 289 
construction, 137-140 
derivation of formulae for missing values, 
163-165 
efficiency, 136, 147-150, 152 
example, 145, 150, 153, 490 
layout, 135, 142-144, 281, 289 
missing rows or columns, 429-434 
missing values, 157, 158, 163-165 
randomization, 140-142, 150 
with additional rows or columns, 429, 434 
Lattice design 
adjustment of treatment mean or total, 
324, 326, 332, 335, 339, 342, 348, 355, 
356, 359, 383, 387, 391, 393, 401, 404, 


Generalized error variance, 89 
Graeco-latin cube, 471 
Graeco-latin square, 464-468 


Half-plaid latin square design, 475-477 
Hartman, J. D., 422 

Henderson, C. R., 220 

Hierarchical classification, 96, 97, 106-109 
Homeyer, Р. G., 297 

hsd test, 29, 30, 32, 205, 448 
Typer-graeco-latin cube, 470-472 
Tlyper-graeco-latin hypercube, 470 


Hyper-graeco-latin square, 468-470 
Hypothesis, 2 

Hypothesis, alternative, 2 
Hypothesis, null, 2 


Incomplete block, 229 e! seq. 
Incomplete block design; also see Lattice 
designs 
adjustment of treatment mean or total, 
232, 238, 244, 246, 250-253, 267, 417, 
418, 421, 426, 428 
analysis, 229 ef seq. 
arrangements, 229 ef seq. 
efficiency, 233, 234, 238, 239, 245, 261, 
262, 268, 416, 422 
example, 236, 247, 255, 274, 419, 424 


405, 417, 418 

advantages and disadvantages, 308, 311 

analysis, 263-268, 311-313, 319 et seq. 

classification, 314-318 

construction, 307, 318, 319, 333, 336, 341, 
343, 344, 351, 364-367, 381, 385, 393, 
424, 431 

degrees of freedom for effective error 
variance, 369-371 

efficiency, 308, 320, 324, 335, 340, 342, 
346, 355, 378, 384, 387, 391, 393, 405, 
416, 422 

example, 321, 332, 337, 341, 346, 349, 381, 
385, 395, 407, 419, 424 

five-dimensional, 366-368 

four-dimensional, 364, 365, 413 
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four-restrictional, 413 
general error variance of a difference, 313 
layout, 311 el seq. 
missing data, 368, 369, 413, 414, 419 
n-dimensional, 316, 318, 364-368 
one-restrictional, 318-376, 378, 415-429 
randomization, 311, 320, 336, 349, 379, 
415 
requirements for balance, 307, 340 
s-restrictional, 316, 318 
tests of significance, 369-371, 414, 419 
three-dimensional, 315-317, 348-364, 411, 
412 
three-dimensional with more than three 
replicates, 363, 364 
three-restrictional, 412 
two-dimensional, 315, 318-348, 378-410 
two-restrictional, 378, 411 
variance of a difference, 313, 323, 329, 
362, 363 
Lattice rectangle design, 388, 389 
Lattice square design, 
adjustment of mean or total, 383, 387, 
391, 393, 401, 404, 405 
analysis, 380-410 
arrangement, 379, 380, 384, 389, 391, 
393-395 
efficiency, 378, 384, 387, 391, 393, 405 
example, 381, 385, 395, 407 
layout, 378, 379, 381, 385, 389, 407 
missing data, 413, 414 
randomization, 379 
tests of significance, 414 
with less than (k + 1)/2 replicates, 
389-391 
with more than (k + 1)/2 replicates, 
391-393 
Least significant difference, 91, 238 
Least significant range, 27, 28, 33, 36 
Least significant sum of squares, 33, 36 
Least squares estimate 
of effects for a p X q factorial, 215, 216 
of effects in a completely randomized de- 
sign, 101, 102, 104, 105 
of effects in a double lattice design, 
371-374 
of effects in a latin square design, 159, 161 
of effects in a randomized complete block 
design, 129, 132 
of effects for all possible crosses, 223 
of intercept in regression, 110-112 
of linear regression coefficient, 110-112 
of sample mean, 98, 99 
properties, 98 
Linear model 
an observation, 99 
change-over design, 448 
change-over design with a covariate, 507 
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completely randomized design, 101, 106, 
107 
completely randomized design with a 
covariate, 485 
double lattice design, 371 
double lattice design with a covariate, 
520, 521 
experiment on a perennial crop, 301 
latin square design, 158, 160, 162, 163 
latin square design with a covariate, 489 
p X q factorial in а randomized complete 
block design, 215, 220, 223 
р X q X kfactorial in a randomized com- 
plete block design, 218 
randomized complete block design, 127, 
131 
randomized complete block design at sev- 
eral locations, 299 
randomized complete block design with a 
covariate, 487, 488, 515, 516 
split block design with a covariate, 497 
split plot design, 298 
split plot design with а covariate, 495 
Linked block design, 437 
Long term experiment 
design and analysis, 456-463 
example, 459 
types, 457 
Isd test, 20, 21, 29, 32 


Magic latin square design, 478-480 

Maximum curvature, 62, 63 

McCarthy, P. J., 142 

Method of fitting constants, 126 

Methods for controlling experimental varia- 
tion, 483 

Minimum sum of squares among means 
when extremes are retained, 37-39 

Morris, W. H. M., 241 

msd test, 21 

Multiple F tests, 32-42 

Multiple range tests, 20-32 

Multiple é test; see lsd test 


Nonparametric tests, 51 

n-treatment type I error; see p-treatment 
type I error 

Null hypothesis, 2 


Optimum multifactorial experiment, 269 
Orthogonal latin square, 343, 466-468 


Partial confounding 
adjustment of treatment mean 
246, 250-253, 267 
analysis, 244-270 
example, 247, 255 
general method of analysis, 263-265 


or total, 


* 


Index 543 


recovery of interblock information, 
266-268 
3” series, 251-253 
2^ series, 244-251, 263-270 
Partially balanced incomplete block design 
(p.b.i.b.), 308, 313, 318, 415, 423-429 
Partially balanced lattice square design, 388 
Pen technique, 58 
Permanent effect, 456 
Plaid latin square design, 477, 478 
Plot technique, 58 
Proportional factorial, 212, 213 
Pseudo-effect, 318 el seq. 
Pseudo-factorial, 318 


Quadruple lattice design, 318, 348 
Quasi-effect, 318 

Quasi-factorial, 307, 308, 318 
Quasi-latin square, 378 
Quasi-latin square design, 472-478 


Randomization, relation to experimental 
error, 1 
Randomized complete block design 
advantages and disadvantages, 114, 115 
analysis, 115-127 
confounding of effects, 242 
derivation of formulae for missing values, 
133, 134 
efficiency, 116, 119, 124 
example, 117, 120, 176, 191, 203 
layout, 114, 115, 273, 288, 289 
missing value, 125-127, 179, 180 
missing values in a covariance analysis, 
513-515 
unequal numbers of observations, 124-127, 
513-517 
Rectangular lattice design 
near balance, 345 
simple, 345 
triple, 345-348 
Replicate 
shape, 70, 71 
size, 71 
Replicates, 
Harris-Horvitz-Mood method for deter- 
mining number of, 73-77 
Tang’s method for determining number 
of, 76, 78, 79 
Tukey’s method for determining number 
of, 78, 80 
Replication, 70-80 
Replication, relation to experimental error, 1 
Residual effect, 438, 444, 447, 451-453, 457 
Reversal design, 438 
Richards, M. L., 454 
Robson, D. S., 50, 206-209 
Rotation experiment, 438, 459-463 


Rotations, comparison of, 462, 463 


Sade, A., 140 
Samples, relationship between size of experi- 
ment and number of, 84 
Sampling 
determination of optimal rate of, 82-85 
loss in information due to, 81, 82 
Sampling error, 120, 123 
Sampling unit 
definition, 58 
optimum number, 67 
Scheffe’s multiple F test, 40-42 ә 
Scientific experimentation, principles of, 2-6 
Selection of n largest from v means, 44 
Selection of treatments, 166 
Self-adjugate square, 137 
Self-conjugate square, 137, 138, 140 
Semi-balanced lattice square design, 371-384 
Semi-latin square design, 429-431 
Short cut to allowances, 30-32 
Shrikhande, S. S., 309 
Significance level for /, an approximation, 
94, 95 
Simple lattice; see Double lattice 
Smith, H. F., 496 
Split block design 
advantages and disadvantages, 288 
analysis, 288-291 
efficiency, 290, 291 
layout, 288-290 
missing data, 291 
randomization, 288 
Split plot design 
advantages and disadvantages, 272, 273 
analysis, 273-288, 291-306 k 
efficiency, 274, 279, 280 
example, 274, 284 
for non-factorial experiments, 434-437 
layout, 273, 275, 281, 292, 296 
missing data, 287, 288 
randomization, 273 
Split plots in a latin square, 280, 281 
Split split plot design, 291-295 
Standard deviation, 91 
Standard error of a difference 
balanced lattice design, 343 
change-over design, 449, 452 
completely randomized design, 91 
cubic lattice design, 354, 355, 375, 376 
derivation for cubic lattice design, 375, 
376 
derivation for double lattice design, 375 
double lattice design, 323, 324, 335, 375 
five-dimensional lattice, 368 
four-dimensional lattice, 365 
incomplete block design, 261, 422 
latin square design, 145, 147, 152, 156 
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lattice square design, 384, 387, 391, 393, 
406, 410 
randomized complete block design, 119 
rectangular lattice design, 345, 316 
simple change-over design, 412 
split block design, 289-291 
split plot design, 279, 287, 292, 293, 435, 
436 
triple lattice design, 340 
with covariance, 493 
Standard error of a single determination; see 
Standard deviation 
Standard error of a treatment mean, 91, 93, 
193, 203, 250-253 
Standard error of an effect, 179, 183, 186, 
188, 201, 203, 205, 206, 250 
Standard square, 137, 139-141 
Statistics in relation to scientific experi- 
mentation, 6 
Strip trial design, 288 
Student-Newman-Keuls multiple range test, 
21-26, 29, 32 
Sub-plot; see Split plot 
Subunits in strips, 288 
Super magic latin square design, 480-482 
Switch-back design, 438 
Switch-over design, 438 
Symbolism, 16, 17 
Systematic design 
advantages and disadvantages, 9 
balanced, 15 
Beavan's chessboard, 8, 430 
Beavan's half-drill strip, 9 
diagonal square, 8 
Knight's move, 8 
Knut Vik square, 8, 469 


Test for additivity, 49-51, 206-209 
Test of significance 

definition, 3 

power, 3 

probability levels, 51-57 
Tied latin square design, 440 


Transformation of data 
arcsine or angular, 47, 177, 185 
logarithmic, 47, 48 
square root, 46, 47, 95 
Treatment, definition of, 1 
Treatment design, 168 
Treatments applied in sequence, 438-463 
Triple lattice design, 318, 336-340, 349 
Tukey, J. W., 31 
Tukey’s gap, straggler, and variance test, 
42-44, 123 
Tukey’s test based on allowances; see hsd 
test 
Two-way whole plot design, 288 


Unbalanced partial confounding, 230, 246, 
267 


Variability of experimental material, 1 
Variability of experimental material, meth- 
ods of reducing, 12, 13, 483 
Variance 
of a difference with covariance, 486 
of a mean, 99, 100, 417, 418, 428, 429 
of an effect, 180 
of effects in a p X q factorial, 216, 217 
of effects in a randomized complete block 
design, 129 
of linear regression estimates, 112, 113 
Variances, tests of homogeneity for, 48, 49, 
89, 90, 93, 185, 208 
v-treatment type I error, 20, 21, 29, 32 
Vittum, M. T., 477, 478 
Weighing design, 482 
Weighted squares of means, 126 
Whole plot, 271 ef seq. 
Whole plots in a latin square design, 280, 281 


Yates, F., 142 

Youden linked block design, 437 

Youden square design, 429, 431-433 

Youden square design, partially balanced, 
434 
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PROBLEMS 


CHAPTER IV - THE COMPLETELY RANDOMIZED DESIGN 

PROBLEMS 

1. Fifty-four plants are selected at random from the area planted to variety 
109. These аге the same plants on which dry weight of shrub was obtained in 
table IV-2. The character rubber percentage was Obtained on the individual 


plants. The data are: 


Plant No. Normals Offtypes Aberrants 
1 6.97 6.21 4.28 
2 7.11 5.70 7-71 
3 7.26 6.04 6.48 
n 6.80 4.47 7.71 
5 7.01 5.22 ТЭТ 

, 6 7.00 5.55 7-20 
T 6.55 i, ks 7-06 
8 6.37 4.84 6.40 
9 7.29 5.88 8.95 

10 7.51 5.82 5.91 

1 6.86 6.09 5.51 

i2 6.81 5.59 6.56 
| 15 6.45 6.06 

14 7-43 5.59 

15 6.68 6.74 

16 1.29 

iT 7.12 

18 6.68 

19 1.3% 

20 5-15 

21 6.41 

22 6.45 

23 6.32 

24 6.82 

25 6.86 

26 6.48 

27 7-28 

БҮ, 185.88 85.25 ^ 80.92 

zx 1258.56 478.98 561.64 

РАЯ 210h6.5h 8267.75 2099.80 


(1) Test the mean differences of normals anà offtypes and of offtypes and 


aberrants by t-test. 

(ii) Are the variances homogeneous? 

(iii) Run a covariance analysis of rubber percentage 
shrub. 

| 1 


(x) on dry weight (X) of 


2 THE COMPLETELY RANDOMIZED DESIGN 

(iv) Does the regression of the means differ from the average within-type of 
plant regression? 

(v) Do the within-type regressions differ from the average within regres- 
sion? 

(vi) Is the relative variation among dry weights of shrubs significantly 
greater than that among rubber percentages for aberrants? 


2. The 54 plants of variety 109 were analyzed for resin percentages. The data 
follow: 


апе em Nome M OC 6 eypes Е Е Aberrente: 

x 5-71 6.17 3.97 
2 6.15 6.04 6.65 
5 6.05 5.89 5.4} 
4 5.6} 5.91 7.20 
5 5.85 5.22 6.52 
6 5.62 5.75 6.51 
T 5.60 5.58 5.92 
8 5.00 5.99 6.81 
9 6.06 5. T.34 
10 6.05 5.88 5.55 
11 5.2% 6.15 5.22 
12 5.66 5.85 5.95 
15 5-55 5.88 

ИЛ 6.25 6.5% 

15 6.06 5.85 

16 6.10 

17 6.07 

18 7-15 

19 6.55 

20 5.85 

21 5.85 

22 5.67 

25 6.01 

24 5.64 

25 5.88 

26 5.63 

27 6.35 

D^ 157.16 87.68 75.08 

m 905.21 513.80 455.06 


(4) Do the types differ with regard to resin percentages at the end of one 
year's growth? 

(11) Compute а least or minimum significant difference. Has it any meaning 
for these data? 

(111) Compute the coefficient of variation. Do you believe that the varia- 


tion among plants was so great as to obscure differences among the 
types for resin percentages? 


5. For the sample X, =, X, «2, and X, = 5, graph the sum of squares for ^" 


ious values of ie Find the least pee estimate of u and its variance. 


LITERATURE CITATIONS 5 
lh. Given the following breakdown of the total degrees of freedom: 


Source of variation af 


Among regions 2 
Among types within regions 5 
Among individuals within types 25 


Find the expectation of mean squares, estimate the variance components, and ob- 
tain an analysis of covariance for the data presented by Day and Fisher, Ann. 


Eugenics 7:555, 1957. 


5. By the method of section IV-2.5, obtain the least squares estimate of и and 
В in the model Y, =H + e, - X) + €,. Obtain the variances and the covari- 


ance of y and b. 


6. H. 0. Dunn (Cornell Univ., unpublished data) obtained the body weights of 
calves at 8 weeks of age. There were 5 levels of feeding given to а random 
sample of five calves each. The completely randomized design was used. Obtain 


the analysis of variance, compute F, and obtain the standard error of а feeding 


treatment mean for the data on body weights: 
Level of feeding 


Subnormal Normal Supernormal 
n8 ile 162 
122 129 175 
121 134 168 
126 132 185 
109 155 172 
Total 596 672 858 


LITERATURE CITATIONS 


Allan, F. E., Australian Inst. Agr. Sci. J. 1:109, 1936. 
(3 varieties with 6 greenhouse pots each) 
Bartlett, M. S., J. Agr. Sci. 25:238, 1955. 
4 breeds of dairy cows with 4, 4, 4, and 8 cows per breed) 
Bliss, C. I., Ann. Math. Stat. 17:222, 1946. 
(Standara and new drugs) 
Day, B., and Fisher, В. A., Ann. Eugenics 7:333, 1957- 
(5 regions with Gare within regions and individuals per type, unequal numbers) 
Dudley, F. J., and Read, D. R- Harper Adams Utility Poultry J. 34:65, 1949. 
(5 examples on feeding poultry) 


on, D., Emp. J- Exp. Agr- 1:17, 1955. 


Fraser, A. H. H., and Roberts 
per treatment) 


(2 feeding treatments with 16 lembs 
Hendricks, W. A., J. Agr. Sci. 25:258, 1955. 

(2 diets of 7 white rats each; 6 lots of chickens with 8 groups) 
—— , Poultry Sci. 14:365, 1925- 

(4 lots of chickens) 

Mahalanobis, P. C., Indian J. Agr. Sci. 2:131 1955. 
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(5 treatments with 20 samples per treatment on 6 characteristics of cotton 
plants) 


Merrill, T. A., J. Agr. Res. 69:9, 190%. 
(2h soils and fertilizer treatments with 4 pots each) 


Norton, C. L., etal., J. Dairy Sci. 29:251, 1946. 
(Examples of 2, 2, and 5 feeding treatments with 8, 12, and 5 calves each, re- 
spectively) 


Salisbury, б. W., J. Dairy Sci. 27:551, 191}. 
(2 treatments on several characteristics of bull semen with 10 bulls on each 
treatment) 


Wilcoxon, F., Biometrics 1:80, 1945. 
(2 insecticides on houseflies with 8 trials each) 


Wishart, J., and Sanders, Н. G., Emp. Cotton Growing Corp., 1936. 
(5 pseudo-treatments on uniformity data in 10 plots each) 


Woodman, Н. E., etal., J. Agr. Sci. 26:546, 1956. 
(3 group feeding treatments with 10 pigs in each) 


Other examples may be found in various textbooks, e.g., Snedecor, Statistical 
Methods (ch. 10; pp. 318, 341); Goulden, Methods of Statistical Analysis (ex. 
29); Leonarà and Clark, Field Plot Technique (ch. 12, table 1); Cochran and 
Cox, Experimental Designs (рр. 86-95); Fisher, Statistical Methods for Research 


Statistical Metnoos ior muammar. 


Workers (ex. 41, 44); Paterson, Statistical Techniques in Agricultural Research 
(p. 48); Snedecor, Calculation and Interpretation of Analysis of Variance and 
Covariance (ex. 1, 11); and Kendall, The Advanced Theory of Statistics, vol. II 
(pp. 175-181). 


if 
\ 


CHAPTER V - RANDOMIZED COMPLETE BLOCK DESIGN 

PROBLEMS 

1. Bartlett (J. Roy. Stat. Soc., Suppl. 5:185, 1936) presents the following 
data (counts) obtained from an experiment on control of leatherjackets by cer- 


tain toxic emulsions: 


1(control) 


Replicates 2(control) 6 
1 ge ' 66 19; S229) 16. 9 29 
2 60 46 ЗО 5 
3 46 81 Ah ven 16 9 
if 120 59 15: 1210 2 
5 49 64 25 24 8 T 
6 15% 60 Бо 220) E280 MRI 


The design was а randomized complete block. Complete the analysis of variance. 
Are the variances homogeneous and is a transformation necessary? Using the 


method of section тІ-1.1. set 95 percent confidence intervals on the differ- 


ences. 


2. Partition the treatment sum of squares in example 1 into the four orthogo- 
nal comparisons each with one degree of freedom. Two of the components are 
linear апа quadratic. What are the other two? The linear component sum of 
squares is 3492.29, and the additional sum of squares due to quadratic regres- 
sion is 157.05. Verify the latter figure and obtain the sum of squares for the 


other two contrasts each with one degree of freedom. 


3. Delete the yields 9.9 and 36.6 in replic&te VI of table V-1. Compute the 
values for the missing data, and complete the analysis of variance with the 
adjusted sum of squares for treatments. Compute the standard error for а 


treatment mean. 
4, Delete the following values in table V-5: 1.06 and 3.75 in plot 7, 2.06 in 
plot 1, 5.12 in plot 1h, anà 5.00 in plot 29. Complete the analysis of vari- 


ance table and compute the standard errors for the various means. 


5. The following number of celery plants for various treatments were available 


for an experiment: 
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Untreated or 
Sowing dates Cold treated controls 
т 50 10 
II 60 2h 
III 96 48 
IV 100 95 


It is desired to obtain information on all 8 treatments and to use a random- 
ized block design. The plants are to be planted in rows 3 feet apart with T 
inches between plants within the row. The data to be taken are number of days 
from planting to flowering. Since celery is a biennial plant it is suspected 
that the controls (untreated plants) will not flower. How would you design 
this experiment and why? What would be your second choice of a design? 


6. An experimenter had the following amount of material available: 


5 clones lO plants each 

5 8, lines 80 " " 17 

5 FA's Bou " different 
T F's LOM at 2 items 
ndn an Р 


It was desired to make use of all the material if possible and to use 8 plant 
plots. Also, the experimenter wished to compare these items for yield and 


other characteristics under field conditions. What design would you use? How ` 


many replicates of each entry would be available? What is your second choice 
of a design? Why? Give the breakdown of the degrees of freedom for each de- 
sign. 


T. For an experiment with one-year-old cherry trees, 8 storage treatments are 
to be compared. Eighty trees are available for each treatment. The 640 trees 


аге to be planted one foot apart in а 20' х 32" area. Measurements арӣ obser- 


vations to be made are number surviving, the length of shoot growth the first 
year, and the diameter increase at the end of the first and second years. The 
trees will probably be removed after the second year. What experimental design 
would you use? Give the complete experimental plan including the randomization 


and plot numbering; give the reasons for your selection of the design and the 
plot size. 


LITERATURE CITATIONS 


Autrey, K. M., et al., J. Dairy Sci. 30:775, 1947. 
(21 silage mixtures in miniature silos, 10 ты) 
Bartlett, M. S., J. Roy. Stat. Soc., Suppl. 3:68, 1956. 


(6 weed control treatments on oats, h repl of 
i ; umber 
wheat seeds germinating, 4 replicates) Zee enr Т пукшен 0o € 


LITERATURE CITATIONS 1 


, J. Roy. Stat. Soc., Suppl. 4:137, 1951. 

(2 experiments on dairy cows - l treatments in 4 replicates, missing data; Т. 
treatments, germination data on peas, 6 replicates; T treatments, insecticides 
on blowflies, 5 replicates) 

, and Kendall, D. G., J. Roy. Stat. Soc., Suppl. 8:128, 1946. 

(3 groups ага 15 days) 

Beall, G., Biometrika 32:243, 190. 

(7 entomological experiments with 3-7 treatments and 6-10 replicates) 

Becker, E. R., et а1.‚ Iowa State College J. Sci. 11:211, 1937; 12:911, 1951; 


13:243, 1959. 
(2 treatments on pairs of animals) 


Bigot, A., Landbouwk. Tijdschr. Wageningen 46:640, 195h. 
Bliss, C. I., J. Am. Stat. Assoc. 35:598, 190. 

(6 treatments on the 6 rats of a litter with 6 litters) 
Bose, R. D., Agr. and Livestock in India 2:605, 1952. 

(4 barley varieties in ! replicates) 

Bose, S. S., and Mahalanobis, P. C., Sankhya 0:105, 1958. 
(5 treatments in 4 replicates) 

Brandt, A. E., J. Am. Stat. Assoc. 36:285, 19h1. 

(2 corn varieties in 5 years) 

Braude, R., and Foote, A. S., J. Agr. Sci. 32:324, 1942. 
(5 feeding treatments on 8 three-pig litters) 


Carruth, L. A., J. Econ. Ent. 29:205, 1956. 

(11 chemical and mechanical treatments for controlling corn ear warm lervae in 
6 replicates) 

Chowdhury, S-, Indien J. Agr. Sci. 14:391, 1944. 

(Effect of 5 manurial treatments on root rot of pan in 6 replicates) 


Christidis, B. G-, J- Agr. Sci. 25:231, 1955. 
(9 varieties of cotton each planted twice in a replicate; 5 replicates) 


Cochran, W. G., Emp. J. Exp. Agr. 6:157, 1938. 
pa 85 mangolds from 8 treatments in 4 du Ded 3 à groups of words 
treatments), reaction time from ) 


, Biometrics 3:22, 1947- 

(4 times of applying nitrogen to wheat in ! replicates) 
Am. Soc. Animal Prod., Proc. 25:56, 1950. 
(2 feeding treatments on 10 pairs of animals) 


Sci. Agr. 11:281 347; 12:174, 1951- 
(2 feeding Mies dias САБ pairs of pigs; 2 feeding treatments on 9 pairs of 
cows; 2 feeding treatments on 10 pairs of pigs) 

Dabral, B. M., and Chiney, S- 5:› Indian J. Agr. Sci. 8:629, 1938. 

(6 fertilizer treatments on cotton in 6 replicates) 


. 15:4 5 
Das, S., Indian J. Agr. Sci 15:47, 1945 ме ааны) 


(5 depths of placing superphosphate on mus 
Adams oe, Poultry J. 34:65, 1949. 


Dudley, F. J., and Read, D. R-» Harper 
(Mi T) , blocks = pens anā feeding rations 
Ensign, R. D., et а1., Mimeo. report, Colo. A and M, Fort Collins, ighT. 


(Several examples on pinto bean yields 
. Bur. Soil Sci-, Tech. Comm. 10, 1930. 


Fisher, R. A., and Wishart, J-> Imp 
(5 fertilizer treatments on barley in 6 replicates) 


Goulden, C. H., Sci. Ag- 11:681, 1951- 


Crampton, E. W., 
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(9 fertilizers on potatoes in ! replicates) 


Heath, О. V. S., Emp. J. Exp. Agr. 2:205, 195%. 
(5 treatments of soil compaction in ! replicates) 


Hoblyn, T. N., Imp. Bur. Fruit Prod., Tech. Comm. 2, 1951. 
(8 black currant varieties in 6 replicates) 


Kalemkar, Е. J., and Dhannalal, L. A., Sankhya 4:567, 19%0. 
(5 varieties of cotton in 4 replicates) 


Koshal, R. S., et al., Indian J. Agr. Sci. 10:975, 1940. 
(6 varieties of cotton in 10 replicates) 


Krishna Iyer, P. V., Indien J. Agr. Sci. 10:686, 19h0. 
(9 varieties in 6 replicates with 17 missing values) 


Таде11‚ W. R. S., and Cochran, W. G., Ann. Appl. Biol. 25:341, 1938. 
(5 treatments on wireworm in 10 replicates; 6 treatments on wireworm in 8 
replicates) 


Loosli, J. K., et al., Cornell Univ. Agr. Exp. Sta. Mem. 265, 19}. 
(2 feeding treatments on 9 pairs, 10 pairs, ага on б pairs of animals) 


Lynes, F. F., J. Am. Soc. Agron. 21:980, 1935. 
(15 treatments on bindweed eradication in 5 replicates) 


Ma, P. C., J. Agr. Assoc. China 158:15, 1937. 


Mahalanobis, P. C., and Bose, S. S., Indian J. Agr. Sci. 3:529, 544, 1955. 
(5 cotton varieties in 5 replicates; 5 wheat varieties in 8 replicates) 


› and Ray, S., Sankhya 2:597, 1956. 
(5 classes of students and 5 schools on school marks and I.Q.'s) 


Martins, В. G., Bolivia Ministry Agr. 34:1, 1945. 
Ming, W. C., Proc. Crop Sci. Soc. Japan 9:97, 1937. 


Mitra, M., Indian J. Agr. Sci. 7:459, 1937. 
(9 seed treatments on wheat and 7 treatments each in 6 replicates) 


Paterson, D. D., Trop. Agr. 10:267, 303, 346, 1955. 
(Several examples, some ere hypothetical) 


Pearce, S. C., J. Roy. Stat. Soc., Suppl. 10:252, 1948. 
(Numerical examples of interchanged and substituted plots) 


Pope, 0. A., Ark. Agr. Exp. Sta. Bul. 519, 1955. 
(5 cotton varieties in 10 replicates) 


Pruthi, Н. 5., and Narayanan, E. S., Indian J. Agr. Sci. 9:15, 1959. 
(Several characters on 5 sugar cane varieties in 8 replicates) 


Robertson, H. F., Indian J. Agr. Sci. 2:55, 1952. 
(5 fertilizers on rice in 6 replicates) 
Rothamsted Exp. Sta. Reports 1925-1936. 


(Numerous examples and analyses of randomized complete block designs оп а var- 
iety of experiments) 


Snedecor, б. W., Biometrics 4:132, 1918. 
(4 preservative treatments on alfalfa silage in 5 periods) 


Southern Coop. Series Bul. 10, N. С. Okla., etc. AS 

(Several examples) Á = ? „1% 

Telegdy Kovats, L. v., Mezdgazd4s. Kutatas. 10:1, 1937. 

Thomas, W. D., et al., Colo. Agr. Exp. Sta. Misc. Series 435, 1949. 

(Several examples on yields of pinto bean varieties; 5 fertilizer treatments OP 
pinto beans in 5 replicates) 


Thornton, Н. G., J. Agr. Sci. 19:373, 1929, 


——— 
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(T seeà inoculation treatments on runner beans in 6 replicates) 


Titus, Н. W., and Harshaw, Н. M., Poultry Sci. 14:3, 1955. 

(4 diets on Rhode Island cockerels in 15 replicates) 

Turner, P. E., Trop. Agr. 9:120, 206, 1932. 

(Tillage and manurial treatments on sugar cane) 

Vaidyanathan, M., Indian J. Agr. Sci. 43307, 1934. 

(4 treatments on tea in 6 replicates) 

Watson, S. J., and Ferguson, М. S., J. Agr. Sci. 26:337, 1956. 
(4 feeding treatments on 5 sets of 4 cows each; also a missing value) 
Wilcoxon, F., Am. Cyanamid Co., Stamford, Conn., 1946. 

(5 seed treatments in 6 replicates) 

Wishart, J., Biometrika 30:16, 1938. 

(5 protein diets on pigs in 5 replicates) 

, Imp. Bur. Plant Br. and Genetics, 1940. 

(5 barley varieties in 4 replicates) 

Wood, В. C., Trop. Авг. 9:290, 1932. 

(8 varieties of sugar cane in ! replicates) 

, Emp. J. Exp. Agr. 4:365, 1956. 

(5 manurial treatments on bananas in 4 replicates) 

Woodman, Н. E., and Evans, R. E., J. Agr. Sci. 32:85, 1942; 35:1, 155, 1943; 
35:44, 155, 1945. 

(Examples of feeding treatments on pigs) 

Woodman, В. M., and Johnson, D. A., J. Agr. Sci. 36:87, 1946. 
(4 soils in 9 replicates) 


Yates, F., Emp. J. Exp. Agr. 1:129, 1955. 
(Examples with several missing plots) 


In addition to the above examples others may be found in various textbooks; 
e.g., Baten, Mathematical Statistics (р. 255); Cochran and Cox, Experimental 


Designs (pp. 40, 96); Fisher, Design of Experiments (pp. 66, 141); Goulden, 
Methods of Statistical Analysis (ch. XI); Hayes and Immer, Methods of Plant 


Breeding (pp. 309, 326); Immer, Applied Stdtistics (pp. 3, 11, 16, 33, 62); 
Johnson, Statistical Methods of Research (p. 289); Kendall, Advanced Theory of 


Statistics, vol. II (pp. 18h, 230); Leonard and Clark, Field Plot Technique 
(рр. 172, 179, 18h, 185); Love, Experimental Methods in Agricultural Research 
(pp. 21, 37, 59, 69, 75, 115); Mather, Statistical Analysis in Biology (pp. 56, 
130); Paterson, Statistical Techniques in Agricultural Research (pp. 53, 59, 
163, 190, 206, 216, 221, 226); Snedecor, Calculation and Interpretation of Anal- 


ysis of Variance and Covariance (ex. h, 6); Snedecor, Statistical Methods (pp. 
ysis of Variance and Covariance t: 


25%, 265, 270, 280). 


СНАРТЕВ VI - THE LATIN SQUARE DESIGN 
PROBLEMS 


1. Compute the standard error of a treatment total and the corresponding isd 
and hsd for the data of table ҮЇ-1. 


2. Find the efficiency of the latin square design in table 11.11 of Snedecor's 
book relative to a completely randomized design and to the two randomized сош- 
plete block designs obtained when the columns are used as replicates and when 
the rows are used as replicates. Discuss briefly the effect of reducing the 
block size from 25 plots to 5 and the 2-way elimination of variation. Obtain 


the efficiencies adjusted for the difference in degrees of freedom in the two 
error variances. 


3. In example VI-2 an experiment on the marketing of McIntosh apples was dis- 
cussed. The entire experiment consisted of four lx! latin squares. The data 


from two of the latin squares are given in table VI-2. Тһе data from the other 
two latin squares are given below: 


Second part - Week T 


Second part - Week 8 


Friday P.M. 
Saturday A.M. 
Saturday Р.М. 


Compute the analysis of variance on each of the above latin squares. Combine 
the data from the four latin squares into a single analysis of variance. Test 
all meaningful hypotheses and state your conclusions. Compute the standard 
error of a treatment mean from the combined analysis. Also, compute the co- 
efficient of variation and the hsd. 


LITERATURE CITATIONS 


Bachér, I., Nordisk Jordbrugsforskning 15:220, 1933. 
(5x5 1.s.) 


Beckett, W. H., Trop. Agr. 9:7, 1932. 
(% treatments in а lixl 1.5.) К 
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Bliss, C. I., and Marks, H. P., Quart. J. Pharm. and Pharm. 22:82, 182, 1959. 
(l dates of insulin injection, + dosages, and 8 rabbits in two 4x4 1.5.; 8х8 
1.5. on 4 dosages of insulin on two days) 

Bose, В. D., Agr. and Livestock in India 2:605, 1932. 

(4 barley varieties in а hxl 1.5.) 


Bose, S. S., and Mahalanobis, P. C., Sankhya 4:103, 1938. 
(4xk l.s. with 2 mixed-up yields; two xk 1.5. with 2 mixed-up yields) 


Cochran, М. G., Emp. J. Exp. Agr. 6:157, 1938. 
(5x5 1.s. to estimate effect of 5 soil fumigants on number of wireworms) 


› апа Watson, D. J., Emp. J. Exp. Agr. 4:69, 1956. 

(6x6 1.s. used to control and measure observer's bios in estimating plant 
height; rows = order, columns = areas and treatments = observers 
Dominick, B. A., Jr., Unpubl. Ph. D. thesis, Cornell Univ., 1952. 
(Several examples on marketing apples) 

Fisher, В. A., and Wishart, J., Imp. Bur. Soil Sci., Tech. Comm. 10, 1950. 
(5 fertilizer treatments on barley in & 5x5 lis.) 

Gaines, J. C., J. Econ. Ent. 50:119, 1957. 

(6 dusting treatments on cotton in two 6x6 1.s.) 


Garrett, Н. E., and Zubin, J., Psychol. Bul. 40:233, 1943. 
(Four orders of presentation, four illumination levels, and four colors were 


presented to subjects; ах l.s.) 

Gilbert, S. M., E. Africa Agr. J. 4:131, 1938. 

(4x4 1.5. on coffee trees) 

Hansberry, T. R., and Richardson, C. H., Iowa State College J. Sci. 10:27, 


1955. 
(4 spraying treatments on apple trees ina hl 1.5.) 


Hansford, C. G., et al., Ann. Appl. Biol. 20:hoh, 1955. 

(% seed treatments on cotton in а lxh 1.5.3; several characters) 

Harrison, C. J., et в1., Sankhy& 2:55, 1955. 

8x8 1.6, on effect of 8 manurial treatments, B dates, and 8 methods of manufac- 
turing on quality of tea) 

Inman, W. R., et al., Sci. Agr. 20:33, 1939; 22:18, 1941. 


(4 feeding rations in & lx} 1.6. with the rows being different foxes and the 


columns periods) 
Jackson, F. K., and Wad, Y. D., Agr- 
(5 cotton varieties in а 5x5 1-5.) 


Johnson, S. T., J. Agr. Sci. 19:511, 1929. 
(5 manurial treatments on sugar beets in a 5x5 1.5.; 2 examples) 


Kerr, Н. W., Emp. J. Exp. Agr. 2:20, 1954. 
(5 fertilizer treatments on sugar cane in a 5х5 l.s.) 


Ladell, W. В. 5., and Cochran, W. G., Ann. Appl. Biol. 25:541, 1938. 
(5x5 1.s. with rows laid end to end on 5 soil treatments for controlling 
wireworms ) 

Li, H. W., etal., J. Am. Soc. Agron. 28:1, 1936. 

(5x5 Sis C different spacings on millet) 

Mahalanobis, P. C., Indian J. Agr. Sci. 2:157, 1932. 

(13 rice parents ana their hybrids in а 15х15 1.в.) 

Main, W. R., and Tippett, L. H. C., Shirley Inst. Mem. 18:109, 1942. 


xl l.s. on the weaving of cotton cloth) 


Maskell, E. J., Trop. Agr. 6:5, 1999. 
l fertilizer treatments on potatoes in а аЬ 1.8.) 


and Livestock in Indie 5:211, 1955. 


12 THE LATIN SQUARE DESIGN 
Paterson, D. D., Trop. Agr. 10:267, 1955. 
(3x3 1.5.) 
Rothamsted Exp. Sta. Report 1925-1956. 
(Numerous examples on various crops and with various treatments) 
Taylor, S. A., Unpubl. Ph. D. thesis, Cornell Univ., 1919. 
(Several examples) 
Turner, P. E., Trop. Agr. 9:155, 177, 1932; 10:60, 1933. 
(Four 3x3 1.5. on sugar cane manurial treatments; 10 fertilizer treatments on 
sugar cane in а 10х10 1.5.; two 5x5 l.s. on sugar cane) 
› and Potter, J. A., Trop. Agr. 9:44, 1932. 
(5 fertilizer treatments on sugar cane in two 5x5 1.s.) 
Ward, б. M., and Smith, V. R., J. Dairy Sci. 32:17, 1949. 
(5 times of milking as they affect total yield of milk in a 5x5 l.s.) 
Watson, C. J., et al., Sci. Agr. 20:175, 258, 1959; 20:458, 1940; 22:561, 190; 
25:108, 1943; 29:265, 400, 1949. x 4 2 РЕЧ дз; 
(Several numerical examples of feeding treatments with ruminants; second and 
fifth papers have some missing plots as the animals refused to eat the hay) 
Wishart, J., Imp. Bur. Plant Br. and Genetics, 1940. 
(4 varieties of sugar beets in a 4x4 1.5.) 
—, and Sanders, H. G., Emp. Cotton Growing Corp., London, 1936. 
(5 fertilizers on wheat in a 5x5 1.в.) 
Wood, В. C., Emp. J. Exp. Agr. 1:516, 1955. 
(5 varieties of yams in a 5x5 l.s.) 
Yates, F., J. Agr. Sci. 26:501, 1956. 
(5 fertilizers on sugar beets in а 5x5 l.s.; gives analysis when 1 treatment 
bas been omitted; 6x6 1.s. with one missing column and one missing plot) 


, and Hale, R. W., J. Roy. Stat. Soc., Suppl. 6:67, 1939. 
(7x7 l.s. with 2 missing treatments) 


, and Watson, D. J., Emp. J. Exp. Agr. 2:174, 195h. 
(10 observers counting plants and tillers in а 10х10 1.5.) 


Youden, М. J., Contr. Boyce Thompson Inst. 11:207, 1940. 
(5x5 1.5. on soybeans with 5 seed treatments; one column missing) 


› and Beale, Н. P., Contr. Boyce Thompson Inst. 6:437, 1934. 
(5 leaf inoculation treatments of virus on tobacco in a 5x5 1.5.) 


Young, D. M., and Romans, R. G., Biometri¢s 4:122, 1948. 
(3 numerical examples of 4 insulin injections into rabbits (columns) on differ- 
ent dates (rows)) 


In addition to the above numerical exemples, others may be found in various 
textbooks; e.g., Baten, Mathematical Statistics (pp. 259, 261); Edwards, Experi- 
mental Design in Psychological Research (pp. 307, 321, 327-332); Fisher, Statis- 
tical Methods (ех. 46, 46.1); Fisher, Design of Experiments (р. 88); Goulden, 
Methods of Statistical Analysis (ex. 34); Hayes and Immer, Methods of Plant 
Breeding (ch. XX); Leonard and Clark, Field Plot Technique (pp. 174, 10h); 101% 
Application of Statistical Methods to Agricultural Research (p. 441); Kendall; 


Advanced Theory of Statistics, vol. II (p. 258); Love, Experimental Methods in 
in 


Agricultural Research (pp. 51, hh, 86, 9h, 112); Mather, Statistical Analysis 
Biology (p. 98); Paterson, Statistical Techniques in Agricultural Research (pp 
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169, 173); Snedecor, Calculation and Interpretation of the Analysis of Variance 
and Covariance (ex. 5, 12); Snedecor, Statistical Methods (pp. 271, 275, 42k, 
L46, 448). 


CHAPTER ҮТТ`- THE CHOICE OF TREATMENTS AND THE FACTORIAL EXPERIMENT - р" SERIES 
PROBLEMS 


1. Six replicates of a randomized complete block design were used to compare 
nine treatments. Eight of the treatments recieved a lime application in addi- 
tion to one of the eight combinations of three factors, n, p, ane. k each at two 
levels. Thus, the nine treatments represent a 2? factorial arrangement of the 
factors n, p, and k plus a check. Do you agree with the following breakdown of 
the total degrees of freedom? Why or why not? 


Source of variation 
$$$ 


аг 
Replicates 5 
Treatments 8 
Ch. vs others 1 
N 1 
Р; 1 
K 1 
NP 1 
NK 1 
РК 1 
NPK 1 
Error E ho 
Total 55 


What coefficients with what signs are required for the above comparisons? 
Answer by completing the following table: 


ch. vs others 


If E = error mean Square, what is the standard error of difference for the 


comparison of lime versus no lime? What is the lsd for an effect total? for 
an effect mean? 


2. The following data аге of the same nature as described for the experiment 
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6. А 21 factorial experiment on fert 


PROBLEMS 3 


discussed in example VII-2 except that the 48 boys were asked a series of ques- 
tions on stereophotographic pictures. Complete the analysis of these data as 
described in example VII-2. Give the combined analysis for the data on stereo= 
photographic and on non-stereophotogrephic pictures. 


к‚юеюон|он‚ою о 


o 
4 
4 
1 
2 
o 
If 
10 
2 
2. 
m 
2 


9 
2 
2 
1 
2 
8 
о 
1 
4 
2 
2 
2 


5 + 
5. Set up а table in which all effects ina 2 factorial are zero except ABC 


Start with the fact that ell levels of effect totels are equal to lO, and then 


Set up the general case where all levels of effects are equal to a constant, 


зау k, and the yield of treatment 000 15 x. 
I-1 into its three component 


4. Partition the error sum of squares in table VI 
Discuss the results. 


Parts. Test the homogeneity of the three variances. 


1.4 calculate the 95% confidence intervals 


5. Using the method of section 11-1. 
ans for the data of example VII-2. 


for all differences among pairs of me 
сега is to beset up in ап established 


Orchard. It is planned to use individual tree plots, to use 96 trees for the 
experiment, and to use a randomized complete block desiga. Discuss the flaws 
in such experiments, the likelihood of errors, апа the selection of the exper- 
imental trees. Give the experimental plan, including the randomization proce= 


dure and the key-out for the degrees of freedom in the experiment. 


n 
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LITERATURE CITATIONS 
Brady, J., J. Roy. Stet. Soc., Suppl. 2:99, 1935. 
(5 oat varieties x 5 spacings in а 9x9 aes) 


Cochran, W. G., Biometrics 5:22, 1947. n 
(3 levels of chalk x 5 levels of lime plus 2 controls in 4 replicates; 2  fer- 


tilizer treatments on pyrethrum) 
Darroch, J. G., et al., J. Animal Sci. 9:431, 1950. 


(Discussion and plan for a 2? 
data not given) 


de Verteuil, J., Trop. Agr. 11:515, 1934. 


(Check plus 2? fertilizer treatments on coconuts in 4 replicates; 4 years! 
data) 


factorial feeding experiment on sheep; original 


Eden, T., and Fisher, R. A., J. Agr. Sci. 17:548, 197. 


(2 levels of nitrogen x 2 levels of sulphate x 2 times of application plus h 
checks in 8 replicates) 


Houghland, б. V. C., апа Strong, М. O., J. Am. Soc. Agron. 55:189, 1941. 
(25 fertilizer treatments on potatoes in 6 replicates for 5 years) 

Jacob, W. C., Brazilian Ministry Agr. J. 33:41, 190. 

(o in 4 replicates and 3 in 5 replicates) 


Mahalanobis, P. C., Indian J. Agr. Sci. 2:69}, 1932. 
(3x3 factorial of potato varieties and fertilizers in 5 blocks) 


Maskell, E. J., Trop. Agr. 6: 45, 97, 1929. 

(Discusses a 2x3x2 and а 4x4 factorial, and quality x quantity interaction) 
Naik, K. C., Indian J. Agr. Sci. 9:651, 1939. 

(Several examples similar to VII-1) 

Potter, G. F., Proc. Am. Soc. Hort. Sci. 50:443, 1947. 

(2х2 fertilizer factorial on tung trees in 5 blocks) 


Singh, R. N., Indian J. Agr. Sci. 12:588, 19l2. 


(3x5 factorial of distances from leaf and pressure in spraying for woolly 
aphis in 4 blocks) 


Wishart, J., and Hines, Н. J. G., J. Ministry Agr. 36:524, 1929. 
(3 nitrogen x 3 potash factorial on potatoes in 9 replicates) 
Yates, F., Emp. J. Exp. Agr. 1:129, 1933. 

(2? factorial in 10 replicates with 9 missing values) 

——, Imp. Bur. 8011 Sci., Tech. Comm. 25, 1937. 


(2? fertilizer factorial on potatoes in 4 replicates of a randomized complete 
block design) 


In addition to the above citations, numerical examples may be found in various 


textbooks; e.g., Cochran апа Cox, Experimental Designs (ch. 5), and Snedecor, 
Statistical Methods (sec. 15.9). 


CHAPTER VIII - OTHER FACTORIAL EXPERIMENIS 

PROBLEMS 

1. For the data in example VIII-1 compute the standard errors on а mean per 
plot basis. Convert the figures for means and standard errors to tons per acre. 


On the assumption that Sword Bean апа Sunnhemp have something in common make 


unnhemp vs Sword Bean end Sunnhemp + Sword Bean vs 2 Woolly 


the comparisons 5 
two contrasts. 


Pyrol. Also, partition the interaction sum of squares into the 
Plot the results. 


2. Using the model of section II-1.1.4 set 95 percent confidence intervals on 
the variety means and on the nitrogen means. 


5. Leave out 
out the treatments ago 8101, ага 8555 in the 3x3 factorial in ex- 


ample VII-5 ага compute е апа Scr ction V = - 
ute th lysis described in sectio 
III-5.5 on the re 


* Del 1 tre 5 e 
4 ete all treatment: 091 о?2 122 
xcept aP]? ajb,, and a,b, in example VII-3 and carry 
out the analysis proposed in section VIII-5.4. 


5. Verify that the variances and covariances for the G,, бу, апа Ой, are as 
Stated in section VIII-5.1. : " 


LITERATURE CITATIONS 


Autrey, К. M., etal., J. Dairy Sci. 30:775, 197. 
(3 forage crops x 7 Wilting treatments in 10 replicates; silage) 


Bell, б. D. H., etal., J. Agr. Sci. 32:255, 1942. 
(3 potato varieties x 5 cutting treatments in 6 replicates) 


Bliss, C. I., J. Am. Stat. Assoc. 35:498, 1940. 
(3x2 factorial in a nutritional investigation in 6 replicates) 


pects A. E., J. Am. Soc. Agron. 29:658, 1937. 
5 x 2 levels of chemical A x 2 levels of chemical В x 2 levels of 
ical C in 2l flasks) i а 


р] 
Кр F., etal., J. Am. Soc. Agron. 5:521, 1942. 
rtilizers x 3 methods of placement in 5 replicates) 


Dur 
{ oe and Stuart, A., J. Roy. Stat. Soc., A 114:163, 1951. 
E. X 2^ factorial in Т replicates on response errors in personal interviews) 
(Суз) R., Ia Foret, Quebec 2:31, 1940. 
fertilizer factorial) 
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18 OTHER FACTORIAL EXPERIMENTS 


Griswold, R. M., and Blakeslee, L. H., Proc. Am. Soc. Animal Prod. 22:205, 

1959. 

(7 wrappings x 5 temperatures x 2 storages with 11 characteristics on frozen 
pork chops) 

Harrison, C. J., and Bose, S. S., Sankhya 6:151, 1932. 

(2 prunings x 4 Jets in an 6x8 1.s. on tea) 

Hoblyn, T. N., Imp. Bur. Fruit Prod., Tech. Com. 2, 1931. 

(3 strawberry varieties x 4 dates of planting in 4 replicates) 

Klingebiel, A. A., and Brown, P. E., J. Am. Soc. Agron. 30:1, 1938. 

(2 inoculations x 4 levels of limestone in 24 pots) 

Peh, 5. С., J. Am. Soc. Agron. 29:167, 1937. 

(3 rice varieties x 7 numbers of seeds per row in 6 replicates; + varieties x 
7 numbers of seeds in 6 replicates; 5 numbers of plants per hill x 6 distances 
between hills in 10 replicates) 


Snédecor, G. W., J. Am. Stat. Assoc. 31:690, 1936. 

(4 cows x 2 feeds x 3 periods) 

Tharp, W. H., et al., Phytopathology 31:26, 1941. 

(3 nitrogen x 2 potash x 2 phosphorus x 3 cotton varieties in } blocks) 


Wilcoxon, F., Am. Cyanamid Co., Stamford, Conn., 1949. 
(2x2x3 factorial on seed treatments on corn in 3 replicates) 


Wood, R. C., Emp. J. Exp. Agr. 1:316, 1933. 
(Several factorials on yams) 


In addition to the above citations, numerical examples may be found in various 


textbooks; e.g., Anderson and Bancroft, Statistical Theory in Research (ch. 


20); Cochran and Cox, Experimental Designs (ch. 5); and Snedecor, Statistical 
Methods (ch. 11). 


CHAPTER IX - CONFOUNDING IN FACTORIAL EXPERIMENTS 
PROBLEMS 


1. What design would be most appropriate given the following information? 
Nine roasts can be cut from each of 12 animals. The experimenter wishes to 
Study the effect of freezing and storage temperatures and length of storage 
upon tenderness of roasts and to use the following in all combinations: 


storage temperatures - О, 10, and 15 degrees, 
| freezing temperatures  - О, -10, and -20 degrees, and 
length of storage - 30, 90, ага 180 days. 


Give the breakdown of the total degrees of freedom and set up the design. Give 
the design and analysis for six roasts from each of 12 animals with two freez- 
ing temperatures, O anà -20. 


2. The following is a field plan for a 2? factorial experiment. What effects 
&re confounded in each of the replicates? 


E 
1 
П 
1 
1 
D 
t 


Write out the subdivision of the total degrees of freedom in the analysis of 
Variance, summary tables, and adjustments for treatment totals. 


5. For a 2? factorial experiment with four replicates give the appropriate 


Subdivision of the total degrees of freedom, the design, the summary tables, 
and the adjustments for the treatment means for: 
(i) а randomized incomplete block design in which the AB 
interaction is completely confounded in а11 replicates; 
(i1) a randomized incomplete block design in which the 
ABC effect is confounded in replicate 1, 
BC effect is confounded in replicate 2, 
AC effect is confounded in replicate 5, and 
AB effect is confounded in replicate 4. 
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20 CONFOUNDING IN FACTORIAL EXPERIMENTS 
lh. Compute the sums of squares for the linear ага quadratic components for the 
factor a in example 1Х-5 and partition the sums of squares for the AB, AC, and 
ABC effects into single degrees of freedom. Is there апу change in the inter- 


pretetion of the effects? Compute the standard error for each of the 11 com- 
parisons. 


5. Show algebraically that the factor 3/4 is the correct factor to use in ex- 
ample IX-2 for obtaining the adjusted effects for the interactions which are 
partially confounded with incomplete block differences. 


6. Suppose that the B effect was partially confounded with incomplete block 
differences in 2 replicates and that the AB effect was confounded in the other 


2 replicates of a 22 factorial in incomplete blocks of 2 experimental units. 


Give the algebraic formulae for the adjusted effect totals obtained from the 
adjusted treatment totals. 


T. Given the following experimental lay-out: 


Variety у, Variety MI 


о 
Seed size E Seed size Sy Seed size 50 


А2 х 3? factorial is arranged at random within each of the 4 rectangles. Key 
out the degrees of freedom for each rectangle and for all rectangles combined. 


Indicate the components constituting the experimental error and the assumptions 
involved. 


Seed size s o 


8. Discuss confounding in experimental work. Illustrate with exampies. 


9. Discuss the factors determining the number of replicates used in experi- 
mental work. What factors affect the Precision of comparisons among treatment 
means in a replicated experiment? 


LITERATURE CITATIONS 


Eden, T., and Fisher, R. A., J. Agr. Sci. 19:201, 1929. 
(5? factorial in incomplete blocks of 9) 

Finney, D. J., Ann. Eugenics 12:291, 1945. 

(1/2 replicate of a h x 2' factorial on potatoes) 

— —5 J. Agr. Sci. 36:184, 1946. 

(1/2 replicate of a 2° in blocks of 8) 

Fisher, R. A., and Wishart, Jey 


Imp. Bur. Soil Sci. x . 10, 1920. 
(2x2x4 end а 2xh in incomplete blocks of 12) 77 TOch. Comm. 10, 


LITERATURE CITATIONS * 21 
Klingebiel, A. A., and Brown, P. E., J. Am. Soc. Agron. 29:9hh, 1937. 
(2 crops x 4 applications of limestone x 2 regions) 


Ma, R. H., and Kao, L. M., Emp. J. Exp. Agr. 8:23, 1940. 
(5 varieties of rice x 5 dates x 5 spacings with partial confounding of the 3 
factor interaction) 


Rothamsted Exp. Sta. Report, 1955. 
(Several examples of confounding) 


Snedecor, б. W., Biometrics 4:211, 1948. 

(2? factorial in incomplete block of 2 treatments) 

Tischer, R. G., anà Kempthorne, O., Food Tech. 5:200, 1951. 

(Analysis only for a 1/9 replicate of a 31 in incomplete blocks of 27) 


Turner, P. E., Trop. Agr. 12:295, 320, 1935. 
(ln x 2k x 2p x 2ca in incomplete blocks of 16 in sugar cane in ! replicates) 


Vaidyanathan, M., and Subramonia Iyer, S., Indian J. Agr. Sci. 10:215, 1940. 


(3? treatments on sugar cane in incomplete blocks of 9; 5' treatments on sugar 
cane in incomplete blocks of 9) 


Wishart, J., J. Agr. Sci. 28:299, 1958. 


(25 factorial on asparagus uniformity trial data in incomplete blocks of 4 with 
3 factor interaction completely confounded) 


— >» Imp. Bur. Plant Br. and Genetics, 1940. 
(2? (п, р, К) factorial on asparagus in 4 replicates; ИРК completely confounded) 
Yates, F., J. Roy. Stat. Soc., Suppl. 2:181, 1955. 


(2? factorial on peas with NPK completely confounded in 3 replicates; 3? in 
blocks of 9 plots with one replicate on sugar beets) 


———, Imp. Bur. Soil Sci., Tech. Comm. 35, 1937. 
(5? in incomplete blocks of 9; 3x2x2 in incomplete blocks of 6) 


In addition to the above citations, numerical examples may be found in various 
textbooks; e.g., Cochran and Cox, Experimental Designs (ch. 6); Fisher, The De- 
Sign of Experiments (sec. 52-h); Goulden, Methods of Statistical Analysis, 2га 
ed. (ch. 12); Immer, Applied Statistics (ch. 16); Kempthorne, The Design апа 
Analysis of Experiments (sec. 13.4, 14.2, 14.7, 15.8, 16.5); and Leonard ага 
Clark, Field Plot Technique (ch. XIX). 


CHAPTER X - FACTORIAL EXPERIMENTS WITH MAIN EFFECTS CONFOUNDED - SPLIT PLOT AND 
SPLIT BLOCK DESIGNS WITH VARIATIONS 


PROBLEMS 


1. 1943 yield data were obtained on the same six corn hybrids as used in exam- 
ple X-2. The yields in pounds of ear corn from a 2 x 10 hill plot for the six 
hybrids from four replicates of a randomized complete block design are given below 
below in a systematic arrangement of replicates and varieties for Districts I 

and II. Compute the analysis of variance for these date. Compute the analysis 

of variance for years 1942 and 1943 for each district and obtain the combined 
analysis over both districts and years. Set 95 percent confidence intervals on 
the differences for pairs of means. Give an interpretation of the results. 


Table for Problem 1 


Systematic arrangement of replicates and varieties 
of corn hybrid yield data for Districts I and II for 1943. 


Doublecross 


Replicate number 
designation 


т тї III 


Doublecross 
totals 


Iv 


District I 


al Sat 57.5 56.6 3h.9 25.2 142.2 
2-2 37.2 37.0 34.5 33.8 142.5 
-3 28.7 32.2 31.0 28.5 120.4 
15 - 43 34.7 32.7 31.0 30.7 129.1 / 
8 - 38 57.0 37.0 36.6 150.9 


T - 39 


35.2 32.8 


Total 208.7 201.2 

District II 
1-1 24,2 27.7 26.5 25.4 103.8 246.0 
2-2 28.7 51.3 27.4 29:5 116.7 259.2 
kn 5 25.8 24.4 19.9 20.5 90.6 211.0 
15-5 ов ЕКО В METEO 237.5 
8 - 58 28.5 26.2 27.5 24,9 106.7 257.6 


28.1 29.6 
160.6 171.1 


26.2 25 108.4 
156.1 


240.1 
1451.2 


Total 


l 
l 
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2. The folloving design of an experiment was used in an orchard: 


a a 


o 1 $i 


Rep I Rep II Rep III Rep IV 


The ao апа ay treatments were systematically arranged in all four 
Likewise, the bo and by treatments were arranged in the manner shown by design 


and not by chance. The 89 treatment represents no dusting treatment and а. 


at 
represents dusting treatment. The do and bi treatments (fertilizers) were 


arranged in the manner shown because the experimenter wish 
resentative", 


replicates. 


ed them to be "гер- 
The character studied is pounds of fruit per tree. What effects 
are confounded and what effects are unconfounded? Is it correct t 
test to compare the dusting mean Square, the fertilizer mean Square, or the 
interaction mean Square? Why or why not? Also, it was argued that no other 
design coula have been used since the fertilizer and dusting applications were 
made in a farmer's orchard and since oné must use the farmer and regular farm 
Machinery for these operations. Do you agree or disagree on the reasons for 
designing the experiment in this manner? Why? 


О use the F 


3. Ina grazing experiment with 2l dairy cows two replicates in a pasture were 
available; 


it was desired to study the effect of three grazing treatments on 
equal areas of land — continuous grazing, 12 days of no grazing апа then 12 
days of Erazing, and two days of grazing and then two Says of no grazing = with 
four cows per grazing treatment. In addition, two of the four cows on each 
@azing treatment were to be fed concentrates and the other two were not. The 
Yiela Character studied is pounds of butterfat 
Sign for such ап experiment conducted over a 
degrees of freedom in the analysis of varianc 


mean squares for testing the variation among 
Pothesis stated by you? 


per cow per year. Set up a de- 
three year period and key out the 
е. What are the correct error 
the various means under the hy- 


LN *andomized complete block design of 2% treatments in 5 replicates is 
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used. Since the experiment (with the ваше treatments) is conducted at differ- 
ent locations several miles apart, or in different years at the same location, 
but on different parts of the experimental area: 

(i) is it necessary to rerandomize for each location and for each year, 

(ii) is it advisable to rerandomize, and i 

(iii) why? 


5. The following analyses of variance are from 3 locations. The treatments 
are the same at all 3 places: 


Location I Location II Location III 

af ss af ss af ss 
Treatments 9 T 9 T 9 Ts 
Replicates 2) Ry 4 Ry 5 в, 
Error 27 Ej 36 E, h5 Е, 
КОЕ... е — ВЕЕ МОРЕ 5 8 
Total 

59 G 49 с, 59 с, 


What is the method of computation and breakdown of Gegrees of freedom for the 
combined analysis? How does one compute & least’ significant difference for 
place or location means; i.e., what is the n in the formula 


2 
/ т, error mean square 7 


Would such а statistic have any usefulness? Explain. 


6. Ten fertilizer treatments, A, B, C, D, E, F, G, H, I, anà J were tested in 
4 randomized complete blocks on cabbage. 


The following arrangement of plots 18 
for one of the replicates: 


The 
е replicate was divided into two parts across all fertilizer plots. Then two 
applications of DDT dusting (none ana dusting) were made 
J 


the two 
halves of the plots; thus at random, to 


Is € 
E ta way to calculate the interaction of fertilizer treatments and 
€ ч ЯП of DPT dust? Key out the degrees of freedom for this design and 
са: 
© appropriate error mean squares for making the various comparisons e 


T. An experiment was conducted on lh fertiii 


ty levels p, апа} 
levels of plant populations, a, y » А, В, C, and D, 


b, c, ani d. The cro; orn. Thi 
p was sweet corn e 
following field arrangement of a lil à 
a х latin Square with the split plots order 


PROBLEMS 


ro 


in each column was used: 


5 


The remaining columns аге likewise Subdivided but with a different randomiza- 


tion. What is the analysis, taking into consideration the latin square 
arrangement of the split or sub-plots? 


8. Four fertilizer treatments on peas were tested in 5 rendomized complete 


blocks in 1946 and in 4 blocks in 1948. The following breakdown of the total 


degrees of freedom and sums of squares was obtained for the individual years: 


1946 1948 

Source of variation аг ss af ss 
Replicate 2 A 5 G 
Treatment 5 B 3. H 
Error = replicate x 

treatment 6 g 9 z 
Total 11 D 15 J 
Correction for mean eB E x K 
oe E 
Total uncorrected ss. 12 F 16 L 


9. 


Row I 


Row II 
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The field design presented above of a bx} latin square superimposed оп a ixl 
latin square was made (tne arrangement is systematic here but was random in the 
field). The treatments а, b, c, and d in column I are laid out in & systematic 
manner over the plots containing varieties A, B, C, and D in column I; i.e., 

treatment & continues across the ! rows of the latin square containing the 4 

varieties, A, B, C, апа D. Treatments a, b, c, and à are erranged so that each 

one occupies each of the Ah orders within the columns. What is the breakdown of 

the degrees of freedom in the analysis of variance for such а design? 


10. Give the diagrammatic lay-out and the key-out of the degrees of freedom 
for 2? whole plot treatments in an 8х8 latin square design with split plots of 
a 22 factorial set of treatments allotted at random within each whole piot. 


Give the design and key-out of the degrees of freedom taking into account the 
order within whole plots. 


Gesigns. 


Explain the randomization procedure used for both 


11. R. E. Blaser (Virginia Polytechnic Inst., 1950, unpublished results) con- 
dücted ап experiment on 5 brome grass strains (a, b, d, e, f) both alone ага 
with alfalfa. The 2 whole plots (with and without alfalfa) were laid out in a 


randomized complete block design in 4 replicates. The 5 brome grass strains 


represented the split plots and were randomly allotted to the plots within each 
whole plot. The following yields (ary weight in grams ) were obtained: 


Strain of With With With With 
brome alfalfa Alone | alfalfa Alone | alfalfa Alone 


alfalfa Alone 
a 
b 
a 
e 
f 


4742 6176 5302 6129 


Complete the analysis anà write an interpretation of the results. 


LITERATURE CITATIONS 


(ео R. L., Biometrics 2:41, 1946. 
replicates of а r.c.b. design with 4 whole lot; 4 
procedure for estimating a missing split plo аааз oe 


t or a missing whole plot) 
Arceneaux, G., and Herbert, L. P., J. Am. 5 T. 
(6 varieties of sugar cane in a 6x6 l.s Hue oar aot аду, 


for 4 ye: па 4 ions? 
not give original yields but do give the individual 4 аре. М 
Bachér, I., Beretn. Nordisk Jordbrf. 


Hefte: LI 1955. orsker. Foren: 5th Kongr. Kfbenhavn 4-7; 
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(Split plot and more complicated designs) 
Bartlett, M. S., J. Roy. Stat. Soc., Suppl. 3:68, 1936. 


(Control of cockchafer larvae, 5 treatments in whole plots, 8 replicates of a 


r.c.b. design, split plots are two age groups of larvae; transformation of 
data) 


Bescoby, H. B., J. S. E. Agr. College, Wye, Kent 30:215, 1932. 
(4 kinds of nitrogen on barley аге the whole plots in а Жл l.s.; 2 levels of 
phosphorus x 2 levels of potash are the split plot treatments) 


Burkett, A. L., ef al., J. Am. Soc. Agron. 41:255, 1949. 

(2 replicates of 4 ages x 2 harvests as whole plots and 4 turning methods x 2 
exposure periods as split plots; retting of hemp) 

Chakravertti, S. C. et al., Indian J. Agr. Sci. 6:34, 1956. 

(Split split plot design of 5 sowing dates for rice with sub-units of 5 varie- 
ties with each variety plot being subdivided into 5 spacings x 5 numbers of 
seedlings per hole) 

de Verteuil, J., Trop. Agr. 11:515, 195%. 

(Check anà 2? fertilizer treatments on coconuts in r.c.b. designs for 5 years) 


Ensign, R. D., et_al., Mimeo report, Colo. A and M, Fort Collins, 1947. 

(12 pinto bean varieties in 6 replicates of a r.c.b. Gesign at two places; 

6 fertilizer treatments in h replicates of а r.c.b. design at two places) 
Finney, D. J., J. Agr. Sci. 36:56, 65, 19h6. 

(4x4 l.s. with 4 manurial treatments as whole plots; each whole plot was split 
in two and either n and k added or (1) and nk to the two plots; potatoes; 

6x6 l.s. similar to the above; 12 treatments of organic manures as whole plots 
with each plot split in two and either n and К added or (1) and nk; carrots) 
Fisher, R. A., Biometrics 5:500, 1949. 

(Split split plot of 10 farms with 12 cows per farm with the 12 cows being 


divided into groups of 4 for 3 times of reading with 4 treatments or injec- 
tions into the left and right sides of a cow's neck) 


Gould, C. E., and Hampton, W. M., J. Roy. Stat. Soc., Suppl. 5:157, 1956. 
(2 pots with 5 runs per pot with 5 journeys and 3 cylinders 


per run per pot of 
glass) 

Gregory, F. G., etal. J. Agr. Sci. 22:617, 1932. 

(Split split plot of 4 dates of sowing cotton; the first split was for 5 
spacings x 3 irrigations; the second split was for manure and no manure) 

Harrington, J. B., Sci. Agr. 21:589, 1941. 

(2 varieties each of oats, barley, and wheat in a 6x6 1 
3 spacings of rows carried on for 4 years) 


Immer, F. R., J. Am. Soc. Agron. 34:844, 1942. 
(2 whole plots with 4 varieties or crosses per whole plot in 6 replicates) 


——, etal., J. Am. Soc. Agron. 26:03, 1934. : 
(3 replicates and 10 barley varieties grown for 2 years at 6 stations 
Jensen, N. F., Cornell Univ. Agr. Exp. Sta. Mem. 305, 1951. 

(2 dusting treatments, 8 replicates, 7 barley varieties) 

Jodon, N. E., and Beachell, H. M., J. Am. Soc. Agron. 30:212, 1938. 

16 whole plot treatments of 4 rates of seeding x 4 varieties in 4 replicates 
with 4 subplot treatments on spacings and rows at 2 locations) 
Mitra, M., Indian J. Agr. Sci. 7:459, 1937. 

б seed treatments per each of 6 replicates with 3 wheat varieties per seed 
treatment) 

Ostle, B., Biometrics 5:71, 19h9. 

(6 bulis equals whole plots with split plots in a 3x3 1.5. composed of 3 


+8. with split plots of 
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cóllections per bull = rows, 5 dilution rates = columns, and 5 groups of in- 
semination = treatments) 
D. D., J. Agr. Sci. 25:615, 1955. 
СО; dates on 5 A oer on tropical fodder grasses in 8 replicates) 
Trop. Agr. 10:267, 303, 346, 1955. 
(2 TE ае чс in 4 replicates for 3 years; 2 methods of cultivation on 
corn in 2 replicates with 3 rates of seeding as the split plot) 
and Hanschell, D. M., Trop. Agr. 15:199, 1938. 
(5 TERRE of Sugar cane in 5 replicates with 2 spacings as the subplots) 
Peterson, M. L., J. Am. Soc. Agron; 39:412, 1947. 
(2 grazing treatments in.5 replicates for 5 years) 


Quinby, J. R., et al., J. Am. Soc. Agron. 29:269, 1937. 
(19 cotton varieties in ! replicates for 3 years) 


Rau, A. A., Unpubl. M. S. thesis, Iowa State College, 1948. 

(Perennial experiment on coffee) 

Rothamsted Exp. Sta. Report, 1927-1934. 

(Several examples of split plot designs and variations) 

Russell, C. S., Biometrika 35:215, 1945. 

(2 readings on each of several patients on measurement of puerperal uterus) 


Russell, G. A., and Little, V. A., J. Am. Soc. Agron. 38:646, 1946. 


(2? fertilizers in 4 replicates with 3 dates of harvest from plot on devil's 
shoestring) 


Salisbury, G. W., et al., J. Animal Sci. 1:199, 190. 
(Two times on 4 bulls with 4 methods of making semen smears) 


Sen, H. D., Indian J. Agr. Sci. 10:172, 190. 
(9 mea as whole plot in 6 replicates with 5 sugar cane varieties as sub- 
plots 


Singh, M., Emp. J. Exp. Agr. 18:190, 1950. 
fouet in split plot designs; 
3^e and 5 x2 on potatoes) 
Snedecor, G. W., апа Haber, E.'S., 
(6 replicates of 5 cutting dates on asparagus for 10 years) 
Taylor, 5. A., Unpubl. Ph. D. thesis, Cornell Univ., 1949. 
(6 treatments of soil aeration in 8 replicates at 2 dates) 


Thomas, W. D., et al., Colo. Agr. Exp. Sta. Misc. Series 435, 1949. 
(4 replicates of 9 pinto bean varieties at 3 dates) 


Turner, P. E., Trop. Agr. 12:320, 1935. 
(8x8 1.s. on 2x4 fertilizer treatments with each row of the 1.5. divided 
into 5 strip plot treatments) 


Biometrics 2:61, 19h6. 


Tysdal, Н. M., J. Am. Soc. Agron. 27:384 
(5 alfalfa varieties in 3 replicates for 4 years) 

Watson, S. J., and Ferguson, W. S., J. Agr. Sci. 27:67, 1937. 

(2 silos with 4 loads per silo ага 3 samples per load per ЖУ 
Wenger, L. E., J. Am. Soc. Agron. 33:135, 1941. 

(5 seed soaking treatments on buffalo grass with 3 dates of planting) 
Wilm, H. G., Biometrics 1:16, 1945. 
(5 treatments in 4 replicates for 3 years) 
Wishart, J., and Clapham, A. R., 


1935. 
i 955 


J. Agr. Sci. 19:600, 1929. 
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(9 whole plot treatments on potatoes with 5 replicates and 2 sub-plot treat- 
ments) 


, and Hines, Н. J. G., J. Ministry Agr. 36:524, 1929. 
(4 fertilizers on meadow hay in Ух! 1,5. for 5 years) 


» and Sanders, H. G., Emp. Cotton Growing Corp., 1936. 
(6 replicates of 5 mechanical treatments with 4 manurial treatments as the sub- 
plots) 


Wood, В. C., and Paterson, D. D., Trop. Agr. 9:14, 1932. 
(15 varieties of sugar cane in 4 replicates for 4 years) 


Yates, F., J. Roy. Stat. Soc., Suppl. 2:101, 1955. 


(3 varieties as the whole plot and 4 nitrogen levels as the split plot in 6 
replicates) 


› and Cochran, W. G., J. Agr. Sci. 28:556, 1938. 


(Several locations and several years! summary of 5 varieties in a r.c.b. 
design) 


Youden, W. J., and Zimmerman, P. W., Contr. Boyce Thompson Inst. 8:317, 1956. 


(8 whole plots on 5 fibre pots in a 5х5 l.s. with o? treatments as the split 
plots; tomatoes) 


Numerical examples of split plot designs may also be found in various textbooks; 


е.&.› Goulden, Methods of Statistical Analysis (p. 155); Hayes ага Immer, 
Methods of Plant Breeding (ch. XX); Leonard anà Clark, Field Plot Technique 


(ch. XVIII); Love, Experimental Methods in Agricultural Research (p. 9h); 
Paterson, Statistical Technique in Agricultural Research (pp. 190, 197, 211); 
Snedecor, Statistical Methods (pp. 305, 310, 314, 414); and Yates, Design and 


Analysis of Factorial Experiments (p. Th). 


CHAPTER XI - INCOMPLETE BLOCK DESIGNS - GENERAL CONSIDERATIONS AND THE ONE- 


RESTRICTIONAL LATTICES WITH TREATMENTS IN COMPLETE REPLICATES 


PROBLEMS 


1. 


2. 


3. 


4, 


Give the analysis for replicates II and III in table XI-14. 
Give the analysis for replicates I, III, and IV in table XI-18. 
Give the analysis for replicates I, ІІ, and III in table XI-10. 


Analyze the balanced lattice in example XI-h as a quadruple lattice. Show 


algebraically that the four sdjustments in a Quadruple lattice are equivalent 


to the single adjustment in table XI-19 for k = Py. 

5. Give Rao's method of analysis for the data in examples XI-2, XI-5, апа XI-h. 
6. Analyze examples ХІ-2, ХІ-5, and XI-l by the method described in chapter IX. 
T. Give the analysis for replicates II anà III in table XI-21. 

8. Make up a 2? three-dimensional lattice with 4 and with 5 replicates and with 
k 


both experiments. 


= 2 from the uniformity trial data in table XI-22. Complete the analysis of 


9. The following results on yielà (grams of grain) from 4 replicates of an 
experiment designed as a double lattice were obtained for 16 barley varieties 


(nos. 25 to 40) by Henderson (Virginia Polytechnic Inst. 
results): 


» 1950, unpublishea 


Rep I Rep II 


26 27 28 
2323 2470 2223 2394 


29 30 


25 26 27 
9410 |2579 2767 2329 9641 


51 32 


2959 5560 3248 2687] 12454 um 
55 5k - 35. 56 
3184 5021 269% 2791 11690 3178 2950 11729 
57 58 59 ho 59 
2505. 2819 eMe 2737) 10591 2610 2491 9918 
е 43260 


I 
| 
I 
| 


PROBLEMS 
Rep III Rep IV 
25 29 55 5T 
1205 1559 2080 21145 
26 50 3h 38 
1545 288} 2595 3225 


| 


25 29 35 37 
2009 2569 оло 2474 


26 30 5% 
1951 2158 2644 


27 31 35 39 
1540 2510 3527 5183 
28 ze 56 ho 
1668 2 324 1471 2771 


27 5X 


1497 2258 


28 32 
1781 1808 


Complete the analysis of the data and write a short inte: 
results. 


[ 10. Henderson (Virginia Polytechnic Inst., 1947, unpublished results) conduct- 
ed an experiment on the yield of 12 tobacco varieties. The design was a 3x4 


rectangular lattice in 6 replicates. The following yields were obtained: 


Group X Group Y Group Z 
Se вар or M бане: a 
TIO 560 529 1859 648 352 685 1685 2070 
TE 5 20°08 T 5 12 7 6 
T7 8 9 ал; 2 6 10 3 4 
625 674% hii 1710 682 614 516 1812 615 727 619 2001 
6199 - 6405 7732 
= -*- ИИ 
10! 2331 35 T. өү Or 
тет ТО MES Ts 12 7 6 
70} 629 616 1949 1551 1417 
$ 8 9 1 2 6 10 5 4 
БГ g 12 5 9 11 aui 8 
$69 677 479] 1818 855 658 730| оооу И 
7702 1590 
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Complete the analysis and write a short interpretation of the results. 


11. Henderson (Virginia Polytechnic Inst., 1950, unpublished results) conduct- 
ed a yield test of 12 Burley tobacco varieties. A near balance 3x rectangular 
lattice design with four replicates was used. The following data were obtained | 
for the еаг1у harvesting and for the late harvesting: | 


Early Harvesting 


Block Rep II Block 
totals totals 
1459 1583 
1646 1563 
1588 1620 
2515 1658 
eque Gio 
Block Rep IV Block 
totals totals 
x 1539 
e 1525 

1: 
= 1661 
1385 

154 


late Harvesting 
—ÁÁ 


Block 
totals 


1825 
2179 


2085 


LITERATURE CITATIONS 


Rep III Block 


1781 


1669 
7018 


Complete the analysis for each harvesting. 
results. 


12. Gish (Virginia Polytechnic Inst., 1919, unpublished results) conducted an 
experiment on 9 top dressings of fertilizer on Blfalfa. The design was a 3x3 
balanced lattice. The following yields of cured hay (pounds рег 50 square feet) 
were obtained: 


Reps 


N Foluwa оо 


results. 


LITERATURE CITATIONS 


pests, R. E., Unpubl. Ph. D. thesis, Iowa State College, 1948. 
(8? cubic lattice in 3 replicates on oat varieties) 


Chan-Choong, P. A., Agr. J. British Guiana 10:78, 1959. 
(15х15 Quasifactorial оп a rice variety trial) 


Comstock, R, E., et al., J. Animal Sci. 7:320, 1948. 
(3x3 balancea lattice on swine with littermates formi 


ng an incomplete block) 
Cornish, р, A., Ann. Eugenics 10:137, 1940. 


THE ONE- 
INCOMPLETE BLOCK DESIGNS - GENERAL CONSIDERATIONS AND 
i RESTRICTIONAL LATTICES WITH TREATMENTS IN COMPLETE REPLICATES 


(Illustrates method of estimating missing values in Dawson's cubic lattice and 
Yates! lattice square) 


C. D. R., Ann. Eugenics 9:157, 1939. 
(UU х= 5, е 5, b = 27 on corn; 4 characters) 
Federer, W. T., Biometrics 5:1hh, 1949. 

(у = eT ks 5, r - h on corn uniformity data) 


›‚ Cornell Univ. Agr. Exp. Sta. Memoir 299, 1950. 


(у = 3 "verieties" on corn uniformity trial data in blocks of k=3withr=4} 
and with r = 6 replicates) 
— — , and Robson, D. S., Cornell Univ. Agr. Exp. Sta. Memoir 309, 1952. 
(v= 2? "varieties" on corn uniformity trial data in blocks of k = 4 with r = 6 
replicates) 
Galinat, W. C., and Everett, Н. L., Agron. J. 11:15, 1949. 
verieties of sweet corn, k = 3, judges compared sweet corn for 3 varieties 
9 
at a time, г = }) 


Goulden, C. H., Sci. Agr. 25:115, 19. 

= 25, k 2 5, г = h, quadruple lattice; k?- 16, k=}, р = 1, simple 
lattice; КЁ = 25, k = 5, г = 6, balanced lattice) 
Harshbarger, B., Ann. Math. Stat. 15:507, 19hh; 16:587, 1945. 
(6x6 lattice in 4 Groups, г = 8; corn) 
› Va. Agr. Exp. Sta. Memoir 1:1, 1947. 
(6x5 simple rectangular lattice in 6 replicates) 
» Biometrics 5:1, 1949. 
(4x3 triple rectangular lattice, 6 replicates; alfalfa varieties) 
Homeyer, P. G., etal., Iowa Agr. Exp. Sta. Res. Bul. 347, 1947. 
(Triple lattice, k? = 81, r= 6, corn; simple lattice, k? = Gh, r = 4, corn; 
balanced lattice, k? = 25, r 6, corn; cubic lattice, к = 64, r = 9, cotton) 
James, E., and Bancroft, T. A., Agron. J. 45:96, 1951. 


(2? factorial in a balanced inc 
blocks of size k - 2) 


Nair, K. R., Biometrics 8:122, 1952. 
(2 associate class p.b.i.b. design = simple lattice with ү = 25 and r = 2) 

› end Mahalanobis, P. C., Indian J. Agr. Sci. 10:665, 190. 
(100 varieties of rice in simple lattice in 4 replicates but replicates were 
not together in one area as is the usual case) 
Robinson, H. F., and Watson, G. S., N.C. Agr. Exp. Sta. Tech. Bul. 88, 1949. 
(7х8 simple e lattice in 6 replicates, cotton; 7x6 triple rectangu- 
lar lattice in replicates, corn; estimation of missi values i: o 
quiin ng 5 in rectangu 


Wellhausen, E. J., J. Am. Soc. 


Agron. 35:66, 1943, 
(3x3 balanced lattice in 4 replicates on corn) 


Yates, F., J. Agr. Sci. 26:42}, 1936, 
(TxT simple lattice ana h? cubic lattice on orange tree uniformity data without 
recovery of inter-block information) 


» Ann. Eugenics 1:519, 1937. 
(5x5 lattice square in 5 replicates on orange tree uniformity data without re- 
covery of inter-block information) 


7——, Ann. Eugenics 9:156, 1959. 


omplete design of Т replicates with incomplete 


= & wher to n 


ý 


LITERATURE CITATIONS 35 


(5? cubic lattice with no complete replicates on ori 
with recovery of inter-block information) 

» J. Agr. Sci. 30:672, 1940. 

(5x5 lattice square in 5 replicates on sugar beets) 
Zuber, M. S., J. Am. Soc. Agron. 34:30, 1942. 

(5x5 lattice square in 3 replicates on corn uniformity data) 


ange tree uniformity data 


In addition to the above references, numerical examples may be found in various 


textbooks, e.g., Anderson and Bancroft, Statistical Theory in Research (ch. 19); 
Cochran and Cox, Experimental Designs (ch. 10); Goulden, Methods of Statistical 
Analysis (pp. 184, 189, 195, 207); Hayes апа Immer, Methods of Plant Breeding 

(ch. XXII); Leonard and Clark, Field Plot Technique (ch. XX); and Love, Experi- 
mental Methods in Agricultural Research (». 128). 


CHAPTER XII - LATTICE DESIGNS WITH MORE THAN ONE RESTRICTION ON THE ALLOCATION 
OF TREATMENTS IN THE COMPLETE BLOCK 


PROBLEMS 


1. An experiment was set up to test 7 varieties of corn with 7 different 5рас- 
ings. Given that + replicates are to be used give the possible experimental 
designs with a key-out of the degrees of freedom for each Gesign. Discuss 
briefly the applicability of each experimental design listed. 


2. An experimenter desires to compare 119 varieties ага either 2 or 6 check 


varieties. From previous experience it is known that the coefficient of varia- 


tion from experiments designed as randomized complete blocks is usually between 
7 and 11 percent. What type of design would you recommend? Why? If the co- 


efficient of variation were 25 percent which design or designs would probably 
be the most appropriate? Why? 


3. Complete the analysis for the data from replicates III and IV in table 
XII-3. 


4k. Complete the analysis for replicates I, II, and III in table XII-3. 


5. Complete the analysis for replicates IV, V, and VI in table XII-16 by both 


methods; i.e., by the method of Pseudo-effects and by the method of row and 
colum totals. 


6. Using the results in chapter IX complete the analysis for replicates I, її, 
III, and IV in table XII-5. Show arithmetically and algebraically that the 4 
row adjustments are equivalent to the single adjustment AL, 


and show that the 
4 column adjustments are equivalent to yM', 
LITERATURE CITATIONS 


Bliss, C. I., anà Dearborn, R. B., Proc. Am. 
(5x5 lattice square in 


Soc. Hort. Sci, 41:324, 1942. 
3 replicates on corn; ie 
adjusted means) 


missing plot formula; F test of 


Boyd, L. L., Unpubl. M. S. thesis, 


Iowa State Col 5 
(7x7 lattice square, 8 e College, 1948 


replicates оп h9 types of nails in blocks of wood) 


Towa Agr. Exp. Sta. Res. Bul. 318:731 1943. 
(7х7 ий; sql an 8x8 Па 126 square in } 


LITERATURE CITATIONS 3T 


Cornish, E. A., J. Australian Inst. Agr. Sci. 7:19, 19%1. 

(Estimates a missing row or column or treatment in a lattice square; Yates! 5x5 
example) 

, Council Sci. and Ind. Res. Bul. (Australia) No. 175, 1944. 

(Recovery of inter-block information in lattice square experiments with incom- 
plete data) ` 

Day, B., and Austin, L., J. Agr- Res. 59:101, 1939. 

(Plan for К? = 729 entries in k blocks of k of k in 5 replicates) 


Federer, W. T., Unpubl. results, 1949. 

(kxk lattice square with split plots of k plots in 5 anà 6 replicates; k = 2 
and 5 on corn uniformity data) 

» Biometrics 6:34, 1950. 

(6x6 incomplete lattice square with 3 and 6 replicates on corn uniformity data) 


Green, J. M., Unpubl. Ph. D. thesis, Iowa State College, 1947. 
(512 varieties of corn in 8 blocks of 8 split into 8 varieties in 6 replicates) 


Homeyer, P. G., et al., Iowa Agr. Exp. Sta. Res. Bul. 347, 1947. 
(Lattice squares with К = 121, k = 11, r = б, and k? = 25, k = 5, г = 6) 


Kempthorne, O., J. Agr. Sci. 57:156, 1947. 

(5x5 lattice square with 5 replicates; 22 treatments plus 5 controls on pota- 
toes; each plot split into 4 plots with either n, p, К, or npk, or (1), np, 
nk, pk on the ! plots) 


Weiss, M. G., and Cox, G. M., Iowa Agr. Exp. Sta. Res. Bul. 257:291, 1959. 
(31 soybean varieties, К = б, г = 6, b = 31; 49 soybean varieties in a lattice 
square of 4 replicates; neither analysis recovers inter-block information) 


In addition to the above references, numerical examples may be found in various 
textbooks; e.g., Cochran and Cox, Experimental Designs (ch. 12); апа Goulden, 
Methods of Statistical Analysis, 2nd ed. (p. 295). 


CHAPTER XIII - OTHER INCOMPLETE BLOCK DESIGNS 
PROBLEMS 


1. For the 10 treatments in example XIII-1 treatments 3, 4, 5, 6, 7, 8, 9, and 
10 are denoted as 000, 001, O10, O11, 100, 101, 110, and 111, respectively. 
Partition the adjusted treatment sum of Squares into the following contrasts: 


Contrast af Contrast аг 

CER U 

G (powder) 1 GP 1 

M (moisture) 1 MP 1 

GM 1 GMP l 

P (pack) = 2 set versus checks ak 
Checks 1 


Test the hypothesis of no effect for each contrast. What other set of contrasts 


is indicated by the selection of the Particular 10 treatments (see the original 
Paper)? Obtain the sums of squares for the various contrasts. Give your inter- 
pretation for both analyses. 

2. For the data of example XIII- 


1 obtain the treatment means adjusted for 
intra&block information only. 


Compute the appropriate standard error of э mean 
and the standard error of a difference between two means adjusted without re- 
covery of interblock information. 


How many degrees of freedom are associated 
with these standard errors? 
3. Partition the sum of Squares for treatments 
XIII-2 into 8 individual comparisons, each with 
interpretation of the results 


(eliminating block) in example 
one degree of freedom. Give an 


4, 


Given that the linear model for the desi 
X 


doc P * E ToO EY. + 
effect, т 


En in section XIII-h.] is 
Bs oy where u = mean effect, p 


p whole plot or cross effect, в, j = random error deviation associated 


7 effect of hth line 9r selection from ith cross, 
random error deviation associated with 1 


121,8,... g; andh = Чуд Gees k; 
cross i has nothing in common with h 
standard errors given in formulae (XI 


js replicate 


with whole plots, Yin 


Pijn * 
21,2, e, Tj 

h is dependent upon i since h = 1 in 

= lin cross i! for ії #4; 
11-19) to (XIII-81). 


dhth split Plot; j 
derive the 
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5. Ignore the fact that the data in table XI-18 are in replicates, and assume 
that the groups of treatments were randomly allotted to the 12 incomplete 
blocks. Complete the analysis for the data as described in example ХІІІ-1. 


6. Assume that there is no grouping of the blocks into compact replicates in 
table XI-1h. Complete the analysis as described in example XIII-2. 


LITERATURE CITATIONS 


Bliss, C. I., Biometrics 3:69, 1947. 

(у = ke), k = 5 times of injections рег day, b = 4 dogs, r = 5 repeated 5 
times; balanced incomplete block lattice; v = ц treatments, К = 2 times of 
injection, b = б dogs, 2 groups of 6 dogs with the whole set repeated 5 times) 
Сарб, B. G., J. Agr. Univ. Puerto Rico 28:22, 19ll. 

(v = 8, k 22, b = 27) 


Cornish, E. A., Ann. Eugenics 10:112, 1940. 
(v = 9, Ъ = 18, г = 8, A= 5, with 5 missing values) 


J. Australian Inst. Agr. Sci. 6:31, 19:0. 
(Estimates missing blocks and missing treatment for balanced incomplete block 
designs and for Youden square; v = 9, К = 4, X= 5) 


Moore, W., and Bliss, C. I., J. Econ. Ent. 35:544, 1942. 
(у = 1, ге 5, b 2 T, К = 5 with split plots of 4 concentrations of chemicals) 


Rao, C. R., J. Am. Stat. Assoc. 42:541, 1947. 
(Partially balanced incomplete block, v = 20, b = 16, k = 5, r = 4) 


Van Rest, E. D., J. Roy. Stat. Soc., Suppl. 4:184, 1937. 
(No data but gives experimental plans for 4 treatments in blocks of 2 where 
the incomplete blocks are halves of a board) 


Wishart, J., Imp. Bur. Plant Br. and Genetics, 1940. 


(21 barley varieties, г = 5, b = 21, К = 5 without recovery of interblock in- 
formation) 


Yates, F., Ann. Eugenics 7:121, 1936. 

(у = 5, К =2, Ъ = 50, г = 12; 6 plants with 5 leaves each and leaves divided 
into left and right portions; v = 7, b = 7 litters of rats, k = 4 = litter 
size, in 4 replicates) 


, Ann. Eugenics 10:317, 1940. 
(у = 9, r= 8, k = 4, Ъ = 18, № = 5 on rat uniformity data; v = 21 tomato var- 
ieties, k = 5, r2 5, b = 21, А = 1) 


Youden, W. J., Contr. Boyce Thompson Inst. 9:41, 1937; 11:207, 219, 

1940. 

(у = 21, k = 5, b = 21 on tobacco; v = 11 talc powders on wheat, k = 6, r = 6, 
b = 1l; v = 21, k = 5, r = 5, b = 21) 


In addition to the above references, numerical examples may be found in various 
textbooks; e.g., Anderson and Bancroft, Statistical Theory in Research (ch. 19); 
Cochran and Cox, Experimental Designs (ch. 11, 13); Goulden, Methods of Statis- 
tical Analysis (pp. 196, 208); Kempthorne, The Design and Analysis of Experi- 


ments (sec. 27.5); and Love, rimental Methods of Agricultural Research (p. 
121). 


CHAPTER XIV - BALANCED DESIGNS 
PROBLEMS 


1. Complete the computations suggested in examples 15.22 and 15.23, Snedecor, 


G. W., Statistical Methods, 1946, and discuss the results of the two experiments 
involved. 


2. Complete the design for the 24 cycles suggested by Cochran, Autrey, and 
Cannon, J. Dairy Sci. 24:949, 1941. 


3. Complete the analysis of the following data (plant height in cm. of tobacco 
plants for two different dosages of cathode rays on tobacco seed; averages of 


20 plants; from an experiment conducted by H. H. Smith, Cornell Univ., 1950, 
unpublished results): 


Replicate number 
2 3 4 5 6 T 


Better C-50 C-66 G-ho C-59 б-%1 G-52 С-18 G-47 
Poorer С-ы G-h9 C-h9 G-30 C-h9 C-h3 G-hs 0-59 


Total for treatment C = 423; total for treatment G = 346. 


4. Complete the analysis on the other half of the plots for the experiment 


described in example XIV-5. Also, run а similar analysis on the yields of 


wheat, peas, and mangolds. Run a combined analysis on all 4 crops. Do you 
agree with the changes made in the experiment? 


5. Compute the standard errors for the various differences and totals in table 
XIV-3. Partition the treatment sum of squares into the comparisons A vs B, 
C vs D, andA+BvsC+pD, 


The latter contrast compares organic with inorganic 
manures. Discuss the results. 


LITERATURE CITATIONS TO CHANGE OVER EXPERIMENTS 


Blosser, T. H., and Smith, V. R., J. Dairy Sci. 30:951, 1947. 
(2 numerical exemples of reversal or switch back designs, 2 treatments) 
Braude, R., J. Dairy Sci. 37:45, 1947. 


(3 pairs of 2 pigs іп a switch-over design) 
Brandt, A. E., Iowa Agr. Exp. Sta. Res. Bul. 


234:60, 1958. 
(3 numerical examples of switch back or reversal Gey aa 
ю 


LITERATURE CITATIONS hi 
Brouwer, E., and Dijkstra, N. D., J. Agr. Sci. 28:695, 1938. 
(2 groups of 4 cows each in a change over design) 


Cannon, C. Y., et el., Iowa Agr. Exp. Sta. Res. Bul. 292:101, 1952. 
(l periods, 2 lots of cows and 2 treatments іп a double reversal) 


Cochran, W. G., etal., J. Dairy Sci. 24:937, 1941. 
(Double change over on 18 cows for 5 periods; some missing values) 


Donald, H. P., Emp. J. Exp. Agr. 7:32, 1959. 
(Switch back, 2 periods, 2 treatments on pigs) 


Johnson, B. C., et al., J. Animal Sci. 1:236, 1942. 
(5 numerical examples of cross over designs on sheep) 


Loosli, J. K., and Lucas, H. L., J. Dairy Sci. 26:291, 1943. 
(Double reversal design, 4 Holstein cows and 3 periods) 

, etal., Cornell Univ. Agr. Exp. Sta. Memoir 265, 1944. 

(4 periods and 12 cows in 5 xh l.s. in е double change over design) 


Seath, D. M., J. Dairy Sci. 27:159, 19hh. 

(4 treatments, 3 periods, 4 groups of 5 cows in a double reversal) 
Watson, S. J., and Ferguson, W. S., J. Agr. Sci. 26:189, 1936. 
(Change over design, 2 treatments, l periods) 


Youden, W. J., and Beale, Н. P., Contr. Boyce Thompson Inst. 6:437, 195%. 
(2 treatments, 8 plants, 6 leaf positions, 2 halves of leaf) 


LITERATURE CITATIONS TO ROTATION EXPERIMENTS 


Cochran, W. G., J. Roy. Stat. Soc., Suppl. 6:104, 1939. 
(4 course rotation on l crops with each crop whole plot split into 4 split 
plots of fertilizers; also a 3 course and 6 course rotation example plus 


several plans) 
Cook, R. L., etal., Soil Sci. Soc. Am., Proc. 10:215, 1945. 
(Numerical examples of seven 5-year rotations іп ! replicates) 


Finney, D. J., Emp. J. Exp. Agr. 8:111, 1940. 
(Systematic design ага frequent changes of treatments with an analysis) 


Mahalanobis, P. C., Indian J. Agr. Sci. 4:361, 195h. 

(Rotation experiment on cotton, groundnut, and juar; systematic scheme with an 
analysis; also some plans) 

Menon, T. V. Gey апа Bose, S. S., Indian J. Agr. Sci. 7:195, 1957. 

(4 course rotation applied in two years with 11 cycles) 


Rothamsted Exp. Sta. Reports, 1930-1956. 

(Several examples) 

Russell, E. J., etal., Fifty years of fielà experiments at the Woburn Exp. 
Sta., Longmans, Green and Co., London, 1936, p. 196. 

(Complete details of example XIV-3) 

Smith, б. E., Mo. Agr. Exp; Ste. Bul. 458:1, 1942. 
- (Data for various rotations) 


Wood, R. C., Trop. Agr. 15:187, 1958. 
(Yields from various crops in l replicates for years 1930-1938) 


СНАРТЕВ XVI - COVARIANCE 
PROBLEMS 


1. The following artificial example of a 


3x3 latin Square design has been 
constructed for ease of computation: 


6 АТ 5 C29 
A16 15 C25 10 B16 8 


C2} 15 B18 15 A 12 
65 36 бо зо 57 2h 


(А, B, с) means; graph the results; 


2. 


Previous example. Compute the missi 
Using the methoa described in exampl 
the results. 


(F) 51.1 12 
(с) 02.5 iœ 
(D) 45.7 100 


Ese mg 
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4, Data on resin percentage and shrub weight (grams) of plants were obtained 
for the varieties and replicates of example V-2. Compare the varieties for 
resin percentage corrected for shrub weight. Discuss the results with regard 
to transformation of the data, applicability of a covariance analysis, suite- 
bility of the regression used, and differences among varieties. 

Plot number, variety number, resin Percentage, and shrub weight of 


2 randomly selected plants from each of 7 guayule varieties grown 
in five randomized complete blocks 


4-109 2-05 1-407 
3.97- 34 4.49- 88 | 5.74- 39 
6.17-105 5.56-1%5 | 5.71-102 

10-405 11-406 13-130 

5.15- 28 | 4.86- 67 5-49-15 

5.10-1h6 | 5.65-110 5.5%-106 
21-595 19-06 16-405 15-109 
5.88- 58 | А. 5.82-125 | 4, 6.22-111 | 6.05- 87 
5.59-101 5 .20-112 4.85- 15 | 6.04- 84 
22-130 23-109 24-405 25-416 26-593 27-407 28-406 
6.13- 70 | 5.6h-101 7.24-115 | 6.30-101 5.36- 99 | 6.20-127 5.17-10% 
5.63-116 | 6.65- 12 | 5.95-170 | 5.75- 15 5.21-105 | 6.74-19 | 6.62-117 

25-595 33-416 31-406 30-109 


5.80- 93 А 5.85- 8 d 586-144 | 5.85-105 
6.15- 73 . 5.75- 55 . 5.75- 98 | 5.89- 88 


8 First number = plot number; second number = varietal designation, 
апа last two pairs of numbers equal percentage of resin ага dry 
weight of shrub from the two plants. 


5. The following data on leaf width (total in em. for 20 plants) on tobacco 
plants are from the experiment described by Federer and Schlottfeldt, 
Biometrics 10:282, 1954. Complete the analysis of these data in the manner 
described by the above authors. Compute the gain in efficiency of the 
covariance analysis over the variance analysis using formula (I-1). 

Compare your result with that obtained in the original paper. Compute the 
standard error of a difference between the adjusted totals for A and F. 


Replicate number 


[sr | > шш evo p ie RR 
F B A E F G B D 


561.6 
в 
307.7 
А 
325.7 
e 
515.5 
F 
265.8 
G 
418.8) 414.8| 384.9] 381.0] 358.1 369.1] 365.2] 381.2 


LITERATURE CITATIONS 


Bartlett, M. S., J. Agr. Sci. 25:238, 1955. 
(Multiple covariance in a c.r. design) 


Brady, J., J. Roy. Stat. Soc., Suppl. 2:99, 1935. 
(Multiple covariance ina 9x9 1.з. with а 5х5 factorial) 


ompact replicates for 
and for a cubic lattice) ; a 


Cox, G. M., et al., Iowa Agr. Exp. Sta. Res. Bul. 281, 1909. * 
(Соуагіапсе for а triple lattice design) 


Day, B., and Fisher, В. A., Ann. Eugenics 1:235, 1937. 
(Multiple covariance in a c.r. design) 


DeLury, D. B., Biometrics 4:153, 1948. 
(Covariance with a 3x4 factorial) 


Federer, М. T., and Schlottfeldt, C. 5. Biometrics 10:282, 1954 
(Multiple covariance in a r.c.b. design) [ 


Fieller, E. C., J. Roy. Stat. Soc., Suppl. 7:1, 19h0. 
Covariance for a change over design) 


Green, J. M., Unpubl. Ph. D. thesis, Iowa State Colle 
(Covariance for a cubic lattice) ? ве, 1947, 


Li, H. W., et al., J. Am. Soc. Agron, 28:1, 1956. 
(Covariance in а 1.5. design) 


Mahoney, C. H., anà Baten, W. D.; 


J. Agr. Res. 58:317, 1 
(Covariance and missing plots in г T, 1939. 


‘c.b. designs; covariance in a 1.5. design) 
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O'Neil, J. B., Sci. Agr. 22:721, 1942. 
(Covariance for a 2x2x3 factorial) 


Quenouille, M. H., Biometrics 4:2h0, 1948. 
(Use of covariance with unequal numbers analyses) 


Robinson, H. F., and Watson, G. S., М. C. Agr. Exp. Sta. Tech. Bul. 88, 1949 
(Covariance analyses for rectangular lattices) 


Smith, H. F., J. Indian Soc. Agr. Stat. 2:111, 1950. Е 
(Covariance in analysis of missing and damaged observations in a 1.з.) 


In addition to the above citations, examples of the analysis of covariance 
may be found in most texts on Statistics; e.g., Fisher, Statistical Methods 

ptatistical Methods 
for Research Workers (ch. VIII); Snedecor, Statistical Methods (ch. 12, 13); 
pint А 21у een Methods 


Wishart, Field Trials II. The Analysis of Covariance; etc. 
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